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Abstract — MDS array codes are widely used in storage systems 
to protect data against erasures. We address tlie rebuilding ratio 
problem, namely, in the case of erasures, what is the fraction 
of the remaining information that needs to be accessed in order 
to rebuild exactly the lost information? It is clear that when the 
number of erasures equals the maximum number of erasures 
that an MDS code can correct then the rebuilding ratio is 1 
(access all the remaining information). However, the interesting 
and more practical case is when the number of erasures is smaller 
than the erasure correcting capability of the code. For example, 
consider an MDS code that can correct two erasures: What is 
the smallest amount of information that one needs to access in 
order to correct a single erasure? Previous work showed that 
the rebuildmg ratio is bounded between ^ and |, however, the 
exact value was left as an open problem. In this paper, we solve 
this open problem and prove that for the case of a single erasure 
with a 2-erasure correcting code, the rebuilding ratio is j. In 
general, we construct a new family of r-erasure correcting MDS 
array codes that has optimal rebuilding ratio of p in the case of 
e erasures, 1 < e < r. Our array codes have efficient encoding 
and decoding algorithms (for the case r = 2 they use a finite field 
of size 3) and an optimal update property. 



I. Introduction 

Erasure-correcting codes are the basis of the ubiquitous 
RAID schemes for storage systems, where disks correspond 
to symbols in the code. Specifically, RAID schemes are 
based on MDS (maximum distance separable) array codes that 
enable optimal storage and efficient encoding and decoding 
algorithms. With r redundancy symbols, an MDS code is 
able to reconstruct the original information if no more than r 
symbols are erased. An array code is a two dimensional array, 
where each column corresponds to a symbol in the code and 
is stored in a disk in the RAID scheme. We are going to refer 
to a disk/symbol as a node or a column interchangeably, and 
an entry in the array as an element. Examples of MDS array 
codes are EVENODD Q, iH, B-code H, X-code g), RDP 
121, and STAR-code IgJ. 

Suppose that some nodes are erased in a systematic MDS 
array code, we will rebuild them by accessing (reading) some 
information in the surviving nodes, all of which are assumed 
to be accessible. The fraction of the accessed information in 
the surviving nodes is called the rebuilding ratio. If r nodes 
are erased, then the rebuilding ratio is 1 since we need to read 
all the remaining information. Is it possible to lower this ratio 
for less than r erasures? Apparently, it is possible: Figure [T] 
shows an example of our new MDS code with 2 information 
nodes and 2 redundancy nodes, every node has 2 elements. 



and operations are over a finite field of size 3. Consider the 
rebuilding of the first information node, it requires access to 3 
elements out of 6 (a rebuilding ratio of j), because fl = (a + 
b) — b and c = {c + b) — b. In practice, there is a difference 
between erasures of the information (also called systematic) 
and the parity nodes. An erasure of the former will affect 
the information access time since part of the raw information 
is missing, however erasure of the latter does not have such 
effect, since the entire information is accessible. Moreover, in 
most storage systems the number of parity nodes is negligible 
compared to the number of systematic nodes. Therefore our 
constructions focus on the optimally of the rebuilding ratio 
related to the systematic nodes. 




Figure 1. Rebuilding of a (4,2) MDS array code over F3. Assume the first 
node (column) is erased. 

In 0, m, a related problem is discussed: The nodes are 
assumed to be distributed and fully connected in a network, 
and the concept of a repair bandwidtli is defined as the 
minimum amount of data that needs to be transmitted in the 
network in order to rebuild the erased nodes. In contrast to our 
concept of the rebuilding ratio a transmitted element of data 
can be a function of a number of elements that are accessible 
on the same node. In addition, in their general framework, an 
acceptable rebuilding is one that retains the MDS property and 
not necessarily rebuilds the original erased node, whereas, we 
restrict our solutions to exact rebuilding. It is clear that our 
framework is a special case of the general framework, hence, 
the repair bandwidth is a lower bound on the rebuilding ratio. 
What is known about lower bounds on the repair bandwidth? 
In Q it was proved that a lower bound on the repair bandwidth 
for an (n, k) MDS code is: 

M n-1 
k n — k' 



(1) 



Here the code has a total of n nodes with k nodes of 
information and r = n — k nodes of redundancy/parity, where 
A4 is the total amount of information. Also all the surviving 
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Figure!. Permutations for zigzag sets in a (5,3) code witli 4 rows. Columns 
0, 1, and 2 are systematic nodes and columns R, and Z are parity nodes. Each 
element in column R is a linear combination of the systematic elements in the 
same row. Each element in column Z is a linear combination of the systematic 
elements with the same symbol. The shaded elements are accessed to rebuild 
column 1. 



nodes are assumed to be accessible. It can be verified that 
Figure [T] matches this lower bound. Note that Equation ^ 
represents the amount of information, it should be normalized 
to reach the ratio. A number of researchers addressed the 
repair bandwidth problem l7l- |fT6l . however the constructed 
code achieving the lower bound all have low code rate, i.e., 
k/n < 1/2. And it was shown by interference alignment in 
lfT4l . IfTSl that this bound is asymptotically achievable for exact 
repair. 

Instead of trying to construct MDS codes that can be easily 
rebuilt, a different approach ifTTl . ifTSl was used by trying to 
find ways to rebuild existing families of MDS array codes. 
The ratio of rebuilding a single systematic node was shown 
to be I + o(l) for EVENODD or RDP codes H], H), both 
of which have 2 parities. However, based on the lower bound 
of ([T]) the ratio can be as small as 1/2. Moreover, related 
work on constructing codes with optimal rebuilding appeared 
independently in lfT9l . 1201 . Their constructions are similar to 
this work, but only single erasure is considered. 

Our main goal in this paper is to design {n,k) MDS array 
codes with optimal rebuilding ratio, for arbitrary number of 
parities. We first consider the case of 2 parities. We assume 
that the code is systematic. In addition, we consider codes with 
optimal update, namely, when an information element is writ- 
ten, only the element itself and one element from each parity 
node needs update, namely, there is optimal reading/writing 
during writing of information. Hence, in the case of a code 
with 2 parities only 3 elements are updated. Under such 
assumptions, we will prove that every parity element is a linear 
combination of exactly one information element from each 
systematic column. We call this set of information elements a 
parity set. Moreover, the parity sets of a parity node form a 
partition of the information array. 

For example. Figure |2] shows a code with 3 systematic 
nodes and 2 parity nodes. The parity sets corresponding to 
parity node R are the sets of information elements in the same 
row. The parity sets that correspond to the parity node Z are 
the sets of information elements with the same symbol. For 
instance the first element in column is a linear combination 
of the elements in the first row and in columns 0, 1, and 2. 
And the Jjt in column Z is a linear combination of all the 
X elements in columns 0, 1, and 2. We can see that each 
systematic column corresponds to a permutation of the four 
symbols. In general, we will show that each parity relates to a 
set of a permutations of the systematic columns. Without loss 
of generaUty, we assume that the first parity node corresponds 



to identity permutations, namely, it is linear combination of 
rows. In the case of codes with 2 parities, we call the first 
parity the row parity and the second parity the zigzag parity. 
The corresponding sets of information elements for a parity 
element are called row and zigzag sets, respectively. 

It should be noted that in contrast to existing MDS array 
codes such as EVENODD and X-code, the parity sets in our 
codes are not limited to elements that correspond to straight 
lines in the array, but can also include elements that correspond 
to zigzag lines. We will demonstrate that this property is 
essential for achieving an optimal rebuilding ratio. 

If a single systematic node is erased, we will rebuild each 
element in the erased node either by its corresponding row 
parity or zigzag parity, referred to as rebuild by row (or 
by zigzag). In particular, we access the row (zigzag) parity 
element, and all the elements in this row (zigzag) set, except 
the erased element. For example, consider Figure |2] suppose 
that the column labeled 1 is erased, one can access the 8 
shaded elements and rebuild its first two elements by rows, 
and the rest by zigzags. Namely, only half of the remaining 
elements are accessed. It can be verified that for the code in 
Figure |2] all the three systematic columns can be rebuilt by 
accessing half of the remaining elements. Thus the rebuilding 
ratio is 1/2, which is the lower bound expressed in ([TJ. 

The key idea in our construction is that for each erased 
node, the row sets and the zigzag sets have a large inter- 
section - resulting in a small number of accesses. So the 
question is: How do we find permutations such that the row 
sets and zigzag sets intersect as much as possible? In this 
paper, we will present an optimal solution to this question 
by constructing permutations that are derived from binary 
vectors. This construction provides an optimal rebuilding ratio 
of 1 /2 for any erasure of a systematic node. How do we define 
permutations on integers from a binary vector? We simply add 
to each integer the binary vector and use the sum as the image 
of this integer Here each integer is expressed as its binary 
expansion. For example, in order to define the permutation 
on integers {0,1,2,3} from the binary vector v = (1,1), 
we express each integer in binary: (0,0), (0,1), (1,0), (1,1)- 
Then add (mod 2) the vector v = (1,1) to each integer, 
and get (1, 1), (1,0), (0, 1), (0,0). At last change each binary 
expansion back to integer and define it as the image of the per- 
mutation: 3, 2, 1, 0. Hence, (0, 1, 2, 3) are mapped to (3, 2, 1, 0) 
in this permutation, respectively. This simple technique for 
generating permutations is the key in our construction. We can 
generalize our construction for arbitrary r (number of parity 
nodes) by generating permutations using r-ary vectors. Our 
constructions are optimal in the sense that we can construct 
codes with r parities and a rebuilding ratio of 1 /r. 

So far we focused on the optimal rebuilding ratio, however, 
a code with two parity nodes should be able to correct two 
erasures, namely, it needs to be an MDS code. We will present 
that for large enough field size the code can be made MDS. 
In particular, another key result in this paper is that for the 
case of a code with two parity nodes, the field size is 3, and 
this field size is optimal. 

In addition, our codes have an optimal array size in the 
sense that for a given number of rows, we have the maximum 
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number of columns among all systematic codes with optimal 
ratio and update. However, the length of the array is expo- 
nential in the width. We introduce techniques for making the 
array wider while having a rebuilding ratio that is very close 
to 1/r. 

We also considered the following generalization: Suppose 
that we have an MDS code with three parity nodes, if we have 
a single erasure, using our codes, we can rebuild the erasure 
with rebuilding ratio of 1/3. What happens if we have two 
erasures? What is the rebuilding ratio in this case? Our codes 
can achieve the optimal rebuilding ratio of 2/3. In general, if 
we have r > 3 parity nodes and e erasures happen, 1 < e < r, 
we will prove that the lower bound of repair bandwidth is e/r 
(normalized by the size of the remaining array), and so is the 
rebuilding ratio. And the code we constructed achieves this 
lower bound for any e. 

In summary, the main contribution of this paper is the first 
explicit construction of systematic {n,k) MDS array codes for 
any constant r = n — k, which achieves optimal rebuilding 
ratio of j. Moreover, our codes achieve optimal rebuilding 
ratio of j when e systematic erasures occur, 1 < e < r. 
The parity symbols are constructed by linear combinations 
of a set of information symbols, such that each information 
symbol is contained exactly once in each parity node. These 
codes have a variety of advantages: 1) they are systematic 
codes, hence it is easy to retrieve information; 2) they have 
high code rate k/ n, which is commonly required in storage 
systems; 3) the encoding and decoding of the codes can be 
easily implemented (for r = 2, the code uses finite field of size 
3); 4) they match the lower bound of the ratio when rebuilding 
e systematic nodes; 5) the rebuilding of a failed node requires 
simple computation and access to only 1 /r of the data in each 
node (no linear combination of data); 6) they have optimal 
update, namely, when an information element is updated, only 
r + 1 elements in the array need update; and 7) they have 
optimal array size. 

The remainder of the paper is organized as follows. Section 

constructs (fc + 2, k) MDS array codes with optimal rebuild- 
ing ratio. Section|lII]gives formal definitions and some general 
observations on MDS array codes. Section HV] introduces code 
duplication and thus generates {k + 2, k) MDS array codes for 
arbitrary number of columns. We discuss the size of the finite 
field needed for these constructions in Section [V] Decoding 
algorithms for erasures and errors are discussed in Section 
fVll Section IVIII generalizes the MDS code construction to 
arbitrary number of parity columns. These generalized codes 
have properties that are similar to the {k + 2, k) MDS array 
codes, likewise some of them has optimal rebuilding ratio. 
Rebuilding of multiple erasures and generalization of the 
rebuilding algorithms are presented in Section IVIIII Finally 
we provide concluding remarks in Section |IX] 

II. {k + 2, k) MDS ARRAY CODE CONSTRUCTIONS 

In the rest of the paper, we are going to use to denote 
{i,i + 1, . . . , ]} and [i] to denote {1,2, ...,/}, for integers 
/ < And denote the complement of a subset X C M as 
X = M\X. For a matrix A, denotes the transpose of 



A. For a binary vector v = (uj, we denote by iJ = 
{vi + 1 mod 2,...,v„ + 1 mod 2) its complement vector 
The standard vector basis of dimension m will be denoted 
as {e,}"!^ and the zero vector will be denoted as eg- For an 
integer n denote by S„ the set of permutations over the integers 
[0, n — 1], namely the symmetric group. For two functions /, g, 
denote their composition by fg or f o g. 

Recall that rebuilding ratio is the average fraction of ac- 
cesses in the surviving systematic and parity nodes while 
rebuilding one systematic node. More specific definition will 
be given in the next section. In this section we give the 
construction of MDS array code with two parities and optimal 
rebuilding ratio 1/2 for one erasure, which uses an optimal 
finite field of only size 3. 

We mentioned in the introduction that each {k + 2,k) MDS 
array code with optimal update can be constructed by defining 
the row and zigzag parities (proofs are given in Section 
Uni l. More specifically, the row parity corresponds to identity 
permutation in each systematic column, and the zigzag parity 
corresponds to a set of permutations {fo,fi, ■ ■ ■ / A-i} for the 
systematic columns {0, 1, . . . ,k}. From the example in Figure 
ID we know that in order to get low rebuilding ratio, we need 
to find fo,---,fk-i such that the row and zigzag sets used in 
rebuilding intersect as much as possible. In addition, since 
each parity element is a linear combination of elements in 
its parity set, we need to define the coefficients of the linear 
combination such that the code is MDS. Noticing that all 
elements and all coefficients are from some finite field, we 
would like to choose the coefficients such that the finite field 
size is as small as possible. So our construction of the code 
includes two steps: 

1) Find zigzag permutations to minimize the ratio. 

2) Assign the coefficients such that the code is MDS. 
The following construction constructs a family of MDS 

array codes with 2 parities using binary vectors. From any 
set T C F^", |T| = fc, we construct a {k + 2,k) MDS array 
code of size 2™ x (fc + 2). We will show that some of these 
codes have the optimal ratio of j. 

In this section all the calculations are done over F2. By 
abuse of notation we use x G [0,2'" — 1] both to represent the 
integer and its binary representation. It will be clear from the 
context which meaning is in use. 

Construction 1 Let A = (ai^j) be an array of size 2"' x k for 
some integers k, m and k < 2™. Let T C ¥2 be a set of vectors 
of size k wliich does not contain the zero vector. For c £ T 
we define the permutation fy : [0,2™ - 1] ^ [0,2™ - 1] by 
fv{x) = X + V, where x is represented in its binary represen- 
tation. One can clieck that tliis is actually a permutation. For 
example when m = 2, v = (1, 0), x = 3, 

/(i,o) (3) = 3 + (1, 0) = (1, 1) + (1, 0) = (0, 1) = 1. 

One can check that the permutation fi, in vector notation is 
[2,3,0,1]. In addition, we define Xy = {x e [0,2'" - 1] : 
X ■ V = 0} as the set of integers orthogonal to v. For example, 
■^(10) ~ {0/1}- The construction of the two parity columns 
is as follows: The first parity is simply the row sums. The 
second parity is the linear combination of elements in the 
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zigzag set. The zigzag sets Zq, Z2ih_i are defined by the 
permutations {fy. : Vj G T} as G Z; if fy.{i) = 1. We 
will denote the permutation fi,. as fj and the set Xyj as Xj. 
Assume column j is erased, and define S,- = {fl,- ^ : / G X^ } 
and Sz = {cii^j : / ^ X^}. Rebuild the elements in S,- by rows 
and the elements in Sz by zigzags. 

Theorem 1 Construct permutations /o,...,/m and sets 
Xq, X,„ by the vectors {£i}"LQ as in Construction\l\ where 
Xo is modified to be Xq = {x e IP^ : x ■ (1,1,. ..,1) = 0}. 
Then the corresponding (m + 3, m + 1) code has optimal ratio 
of\. 

Before proving the theorem, we first give an example. 
Actually, this example is the code in Figure [2] with more 
details. 

Example 1 Let A be an array of size 4x3. We construct a 
(5,3) MDS array code for A as in Theorem [T] that accesses 
J of the remaining information in the array to rebuild any 
systematic node (see Figure^. For example, Xi = {0, 1}, and 
for rebuilding of node 1 (column Ci) we access the elements 
flO,0/ ^0,2/ '^1,0/ '^1,2' and the following four parity elements 

ro = Co,0 + «0,l + «0,2 

n = fli,o + + "1,2 

2/i{2) = ZO = «0,0 + 2fl2,l + 2fll,2 
^/i(3) = = «1,0 + 2fl3,l + ^0,2- 

It is trivial to rebuild node 1 from the accessed information. 
Note that each of the surviving node accesses exactly j of 
its elements. It can be easily verified that the other systematic 
nodes can be rebuilt the same way. Rebuilding a parity node is 
easily done by accessing all the information elements. 

In order to prove Theorem [U we first prove the following 
lemma. We use a binary vector to represent its corresponding 
systematic node. And define \v\u\ = Y^i:vf=i,Uj=0^ 
number of coordinates at which v has a 1 but u has a 0. 

Lemma 2 (i) For any v,u ^ T, to rebuild node v, the number 
of accessed elements in node u is 

2'«^i + |/„(x-,)n/„(x„)|. 

(ii) For any ^ v,u E T, 



\fvix-o)nfu{x^,) 




mod 2 = 
mod 2 = 1. 



(2) 



Proof: (i) In rebuilding of node v we rebuild the elements 
in rows Xy by rows, thus the row parity column accesses 
the values of the sum of rows Xj,. Moreover, the surviving 
node u also accesses its elements in rows X^. Hence, by now 
\Xy \ = 2'"-! elements are accessed. The elements of node v in 
rows X-a are rebuilt by zigzags, thus the zigzag parity column 
accesses the values of the zigzags sums : / G X^}, and 

each surviving systematic node accesses the elements of these 
zigzags from its column, unless these elements are already 
included in the rebuilding by rows. The zigzag elements in 
{Zjr_^(^fj : / G Xj;} of node u are in rows fu^{fv{Xy)), where 



/„ is the inverse function of /j,. Thus the extra elements node 
M needs to access are in rows /j7^ (/zj(Xp))\Xp. But, 

\ fu\fv{^v)) \^v\ 

= \ f-\f,{x,))r\X-, \ 

= |/,7'(/.(x,))ux,| 

= 2"'-|/-i(/„(x,))ux,| 

= 2"' - {\f-\fv{x,))\ + \x,\ - \f-\fv{x,))r^x,\) 

= \fu\fv{^v))r\x,\ 

= |/.(x,)n/„(x-,)|, 

where we used the fact that fv,fu are bijections, and |Xz;| = 

211! — 1 

(ii) Consider the group (F^", +). Recall that /„(X) = X + 
V = {x + V : X E X}. The sets fv{Xj,) = Xj, + v and 
/h(Xi,) = Xi, + u are cosets of the subgroup Xy = {w G 
W2 : w ■ V — 0}, and they are either identical or disjoint. 
Moreover, they are identical iff — m G Xy, namely (v — 
11) ■ V = Y^i;v.=\^ii.=Q 1 = mod 2. However, by definition 
\v\u\ = 'Yli vi=l,ui=^^ mod 2, and the result follows. ■ 

Let {/o/ ■•■///c-i} be a set of permutations over the set 
[0,2™ - 1] with associated subsets Xq, Xfc_i C [0,2™ - 1], 
where each |X,| = 2™^^. We say that this set is a set of 
orthogonal permutations if for any /,/ G [0, fc — 1], 



|/Kx,)n/,(X0| 
9m— 1 



where ^5, ^ is the Kronecker delta. For a set of orthogonal 
permutations, in order to rebuild any systematic node, only 
2'"^^ elements are accessed from each surviving systematic 
node by Lemma|2] And only 2™"^ elements are accessed from 
each parity node, too. Hence codes generated by orthogonal 
permutations has optimal rebuilding ratio 1/2. Now we are 
ready to prove Theorem [T] 

Proof of Theorem^ Since |ei\ej| = 1 for any i ^ j ^ 0, 
we get by lemma |2] 

/Kx,)n/^-(x,) = 0. 

Now consider e,- and bq, for i 7^ 0. Note that fi{Xi) = {x + e, : 
X ■ e; = 0} = {y : y ■ £/ = 1}, so 

/,(X,) n/o(X,) = {y : y ■ e, = 1} n {x : X ■ e,- = 0} = 0. 

Similarly, /,(Xo) = {x + e,- : x • (1, 1, . . . , 1) = 0} = {y : 
y. (1,1,. . . ,1) = 1}, and 

/o(Xo)n/,(Xo) 
= {x:x-(l,---,l)=0}n{y:y(l,---,l) = l} 
= 0. 

Hence the permutations /o, ...,/m are orthogonal permuta- 
tions, and the ratio is 1 /2. ■ 

Note that the optimal code can be shortened by removing 
some systematic columns and still retain an optimal ratio, i.e., 
for any /c < m + 1 we have a code with optimal rebuilding. 

Having found the set of orthogonal permutations, we need 
to specify the coefficients in the parities such that the code is 
MDS. 
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Figures, (a) The set of orthogonal permutations as in Theorem [T] with sets Xq = {0,3}, Xj = {0,1}, X2 = {0,2}. (b) A (5,3) MDS an'ay code generated 
by the orthogonal permutations. The first parity column C3 is the row sum and the second parity column C4 is generated by the zigzags. For example, zigzag 
Zo contains the elements that satisfy ~ 0. 



Consider the {m + 3,m + 1) code C constructed by Theo- 
rem[T]and the vectors {e,}"lg- Let F be the finite field we use. 
Let the information in row i, column j be a, ^ G F. Let its row 
and zigzag coefficients be a, y, /3, y G F. For a row set R„ = 
{au,0,au,l,---,au,m}, the row parity is r„ = TI[=o<^u,jau,j- 
For a zigzag set Z„ = {fl„,0/ • ■ • / ai(+e,„,m}, the zigzag 

parity is z„ = Ejlg ^u+ej,ja^+ej,y 

Recall that the {m + 3,ni + 1) code is MDS iff we can 
recover the information from up to 2 columns erasures. It 
is clear that none of the coefficients ajj,^jj can be zero. 
Moreover, if we assign all the coefficients as a/y = = 1 
we get that in an erasure of two systematic columns the set 
of equations derived from the parity columns are linearly 
dependent and thus not solvable (the sum of the equations from 
the row parity and the sum of those from the zigzag parity will 
both be the sum of the entire information array). Therefore 
the coefficients need to be from a field with more than 1 
nonzero element, thus a field of size at least 3 is necessary. 
The construction below surprisingly shows that in fact F3 is 
sufficient. 

Construction 2 For the code C in Theorem[T]over F3, define 
Uj = I];^o ^' < /' < m. Assign row coefficients as OLi^j = 
1 for all and zigzag coefficients as 

where i = (z'l, . . . , f,,,) is represented in binary and tfie calcula- 
tion in the exponent is done over F2. 

The coefficients in Figure [3] are assigned by Construction 
121 The following theorem shows that the code is MDS. 

Theorems Construction^ is an [m + 3,m + \) MDS code 
with optimal finite field size of 3. 

Proof: It is easy to see that if at least one of the two 
erased columns is a parity column then we can recover the 
information. Hence we only need to show that we can recover 
from any erasure of two systematic columns. In an erasure 
of two systematic columns i,j G [0, m],f < j, we access the 
entire remaining information in the array. For r G [0, 2'" — 1] 
set r' = r + e; + ej, and recall that fl^,/ G Z; iff Z = r + thus 
^r,ii^r',j G -Zr+e, and ayyUyi i G 'Z.y+ej- From the two parity 
columns we need to solve the following equations (for some 
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This set of equations is solvable iff 

Note that the multiplicative group of F3\0 is isomorphic to 
the additive group of F2, hence multiplying two elements in 
F3\0 is equivalent to summing up their exponent in F2 when 
they are represented as a power of the primitive element of the 
field F3. For columns < i < j < m and rows r,r' defined 
above, we have 

. ^ 2'''"'+''''"' = 2(''+''')'"' = 2('''+''7)'^'=o'^' = 2*^? = 2. 

However in the same manner we derive that 

^,/,,,^y = 2('-+'-')-"/ = 2i''+'iy^,=o'i = 2'-^'i =2° = 1. 

Hence Q is satisfied and the code is MDS. ■ 
Remark: The above proof shows that , 7^ fi^r and 
[ir,j = ^r',j for i < i- And (l3]i is a necessary and sufficient 
condition for a MDS code for vectors c, 7^ Vj. 

In addition to optimal ratio and optimal field size, we will 
show in the next section that the code in Theorem [T] is also 
of optimal array size, namely, it has the maximum number of 
columns, given the number of rows. 

III. Formal Problem Settings and Constructions 

In this section, we first give some observations of an 
arbitrary MDS array code with optimal update. Then we 
prove some properties and give some examples of our code in 
Construction [T] 

Let us define an MDS array code with 2 parities. Let 
A = [cii^j) be an array of size p x k over a finite field 
F, where / G [0, p — 1],/ G [0,k — 1], and each of its 
entry is an information element. We add to the array two 
parity columns and obtain an (n = k + 2,k) MDS code of 
array size p x n. Each element in these parity columns is 
a linear combination of elements from A. More specifically, 
let the two parity columns be Cj- = (rg, rj, rp_i)-'^ and 
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Q+1 = (zo,Zi...,Zp_i)^. Let R = {Ro/Ki/-/i?;,-i} and 
Z = {Zq, Zx, Zp_i} be two sets such that -R;,Z; are 
subsets of elements in A for all / E [0, p — 1]. Then for all 
/ e [0, p - 1], define r, = E„6R, ««« and z, = J^^^^z, ^aa, for 
some sets of coefficients {eta}, {jSa} C F. We call R and Z 
as the sets that generate the parity columns. 

We assume the code has optimal update, meaning that only 3 
elements in the code are updated when an information element 
is updated. Under this assumption, the following theorem 
characterizes the sets R and Z. 

Theorem 4 For an [k + 2, k) MDS code with optimal update, 
ttie sets R and Z are partitions of A into p equally sized sets of 
size k, where each set in R or Z contains exactly one element 
from each column. 

Proof: Since the code is a {k + 2, k) MDS code, each 
information element should appear at least once in each parity 
column C/f, C/^^i. However, since the code has optimal update, 
each element appears exactly once in each parity column. 

Let X E R, note that if X contains two entries of A from the 
systematic column Q, i E [0,k — l], then rebuilding is impos- 
sible if columns C, and Q+i are erased. Thus X contains at 
most one entry from each column, therefore | X | < k. However 
each element of A appears exactly once in each parity column, 
thus if |X| < /c, X e R, there is Y e R, with |Y| > k, which 
leads to a contradiction. Therefore, \X\ = k for all X E R. 
As each information element appears exactly once in the first 
parity column, R = {Rg, . . . , Rp_i} is a partition of A into 
p equally sized sets of size k. Similar proof holds for the sets 
Z = {Zq, . . .,Zp_i}. ■ 

By the above theorem, for the ;-th systematic column 
((Tq/ • ■ • /flp-i)"'', its p elements are contained in p distinct 
sets R;, Z G [0, p — 1]. In other words, the membership of the 
j-th column's elements in the sets {R/} defines a permutation 
gj : [0, p - 1] -> [0, p - 1], such that gj{i) = I iff a,- E R;. 
Similarly, we can define a permutation fj corresponding to 
the second parity column, where fj{i) = I iff fl, E Z;. For 
example, in Figure [2] each systematic column corresponds to 
a permutation of the four symbols. 

Observing that there is no importance of the elements' 
ordering in each column, w.l.o.g. we can assume that the first 
parity column contains the sum of each row of A and gj's 
correspond to identity permutations, i.e. r,- = I^^Zq 
for some coefficients We refer to the first and the 

second parity columns as the row parity and the zigzag parity 
respectively, likewise R; and Z;, / E [0, p — 1], are referred 
to as row sets and zigzag sets respectively. We will call fj, 
i E [0, A: — 1], zigzag permutations. By assuming that the first 
parity column contains the row sums, we want to (1) find 
zigzag permutations to minimize the rebuilding ratio; and (2) 
assign the coefficients such that the code is MDS. 

First we show that any set of zigzag sets Z = 
{Zq, Zp_i} defines a {k + 2,k) MDS array code over a 
field F large enough. 

Theorems Let A = (aj^j) be an array of size p x k and the 
zigzag sets be Z = {Zq, Zp^i}, then there exists a {k + 2, k) 
MDS array code for A with Z as its zigzag sets over the field F 
of size greater than p{k— 1) + 1. 



The proof is shown in Appendix [A] The above theorem 
states that there exist coefficients such that the code is MDS, 
and thus we will focus first on finding proper zigzag per- 
mutations {fj}- The idea behind choosing the zigzag sets 
is as follows: assume a systematic column (aq/ ■■•/ ^Jp-i)^ 
is erased. Each element a, is contained in exactly one row 
set and one zigzag set. For rebuilding of element a,, access 
the parity of its row set or zigzag set. Moreover access the 
values of the remaining elements in that set, except a,-. We 
say that an element fl, is rebuilt by a row (zigzag) if the 
parity of its row set (zigzag set) is accessed. For example, 
in Figure |2] supposing column 1 is erased, one can access the 
shaded elements and rebuild its first two elements by rows, and 
the rest by zigzags. The set S = {Sq, Si, Sp_i} is called 
a rebuilding set for column {aQ,ai, ...,ap^iY if for each i, 
S, G R U Z and fl, E S,. In order to minimize the number of 
accesses to rebuild the erased column, we need to minimize 
the size of 

lufJoS^I, (4) 

which is equivalent to maximizing the number of intersections 
between the sets {S,}[Lg . More specifically, the intersections 
between the row sets in S and the zigzag sets in S. 

For a [k + 2, k) MDS code C with p rows define the 
rebuilding ratio R{C) as the average fraction of accesses in 
the surviving systematic and parity nodes while rebuilding one 
systematic node, i.e., 

^ Lj mingp Sp_^ rebuilds; I ULq^ 5/ 

^ ' ~ p{k + l)k 

Notice that in the two parity nodes, we access p elements 
because each erased element must be rebuilt either by row 
or by zigzag. And u|'^q S,- contains p elements in the erased 
column. Thus the above expression is exactly the rebuilding 
ratio. Define the ratio function for all {k + 2, k) MDS codes 
with p rows as 

R{k) = mm R(C), 

which is the minimal average portion of the array needed to 
be accessed in order to rebuild one erased column. 

Theorem 6 R{k) is no less than j and is a monotone nonde- 
creasing function. 

The proof is given in Appendix IB] For example, the code in 
Figure |3] achieves the lower bound of ratio 1/2, and therefore 
R(3) = 1/2. Moreover, We will see in Corollary [TO] that R{k) 
is almost 1/2 for all k and p = 2™, where m is large enough. 

So far we have discussed the characteristics of an arbitrary 
MDS array code with optimal update. Next, let us look at our 
code in Construction [T] 

Recall that by Theorem |5] this code can be an MDS code 
over a field large enough. The ratio of the constructed code 
will be proportional to the size of the union of the elements in 
the rebuilding set in (|4]i. The following theorem gives the ratio 
for Construction [T] and can be easily derived from Lemma |2] 
part (i). 
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Theorem 7 The code described in Construction [T] and gener- 
ated by the vectors Vo,Vi, i^/t-i is a {k + 2,k) MDS array 
code with ratio 



(5) 



2'«fc(fc + l) 

Next we show the optimal code in Theorem [T] is optimal in 
size, namely, it has the maximum number of columns given 
the number of rows. 

Theorem 8 Let F be an orthogonal set of permutations over the 
integers [0, 2"' — 1] , then the size ofF is at most m + 1. 

Proof: We will prove it by induction on ni. For m = 
there is nothing to prove. Let F = {fo, fi, fk-i} be a 
set of orthogonal permutations over the set [0,2'" — 1]. We 
only need to show that \F\ = /c < 77? + 1. It is trivial to 
see that for any permutations g,h on [0,2™ — 1], the set 
hFg = {hfogfhfig, ...,hfi^_^g} is also a set of orthogo- 
nal permutations with sets g~^{XQ),g~^[X-[), ...,g^^[Xi^_i). 
Thus w.l.o.g. we can assume that /o is the identity permutation 
and Xq = [0,2™^^ — 1]. From the orthogonality we get that 

ullMXo) = X-o = [2"'-\2'"-l]. 

We claim that for any i ^ 0, |X, nXo| = = 2^-2. 
Assume the contrary, thus w.l.o.g we can assume that |X/ D 
Xol > 2""-^, otherwise |X; n X^| > T"-^. For any j ^i^O 
we get that 



/^■(x,nXo),/,(x,nXo) cxo, 
|/y(x,-nXo)| = |/,(x,-nXo)|>2'"-2 = 



iXn 



(6) 



(7) 



From equations ^ and (|7]i we conclude that fj{Xj n Xq) fl 
fi{Xj n Xq) 7^ 0, which contradicts the orthogonality prop- 
erty. Define the set of permutations F* = {/f jj^Zj over the 
set of integers [0,2'"-l -1] by =//(x) -2'""^ which 
is a set of orthogonal permutations with sets {X,- D Xol^^-j*^. 
By induction k — 1 < m and the result follows. ■ 
The above theorem implies that the number of rows has to 
be exponential in the number of columns in any systematic 
code with optimal ratio and optimal update. Notice that the 
code in Theorem [T] achieves the maximum possible number of 
columns, m + 1. Besides, an exponential number of rows is 
still practical in storage systems, since they are composed of 
dozens of nodes (disks) each of which has size in an order 
of gigabytes. In addition, increasing the number of columns 
can be done using duplication (Theorem |9l) or a larger set of 
vectors (the following example) with a cost of a small increase 
in the ratio. 

Example 2 Let T = {v e W'^ : \\v\\i = 3} be the set of 
vectors with weight 3 and length m. Notice that \T\ = ('3). 
Construct the code C by T according to Construction\l\ Given 
V e T, \{u e T : \v\u\ = 3}| = ('"3^), which is the number 
of vectors with 1 's in different positions as v. Similarly, \ {u G 
T : \v\u\ = 2}\ = 3('"2^) and \{u e T : \v\u\ = 1}\ = 
3{m — 3). By Theorem^ and Lernma|2] for large m the ratio is 



1 I 2'"-i(^)3( 

2 2'"(™)((™) 



m-3\ 
2 I 



+ 1) 



_9_ 
2m' 



Note that this code reaches the lower bound of the ratio as 
m tends to infinity, and has 0{w?) columns. 

IV. Code Duplication 

In this section, we are going to duplicate the code to increase 
the number of columns in the constructed (fc + 2, k) MDS 
codes, such that k does not depend on the number of rows, 
and ratio is approximately Then we will show the optimality 
of the duplication code based on the standard basis. 

Let C be a (A: + 2, k) array code with p rows, where the 
zigzag sets {Z;}|'^J are defined by the set of permutations 
{fi}\ZQ on [0, p — 1]. For an integer s, an s-duplication code 
C' is an [sk + 2, sk) MDS code with zigzag permutations 
defined by duplicating the k permutations s times each, and 
the first parity column is the row sums. In order to make the 
code MDS, the coefficients in the parities may be different 
from the code C. For an s-duplication code, denote the column 
corresponding to the t-th fj as column < t < s — 1. 
Call the columns : j E [0,k - 1]} the f-th copy of the 
original code. An example of a 2-duplication of the code in 
Figure [3] is illustrated in Figure |4] 

Theorem 9 If a {k + 2,k) code C has ratio R{C), then its s- 
duplication code C' has ratio R[C){1 + -^^^pi)- 

Proof: We propose a rebuilding algorithm for C with 
ratio of R(C)(1 + ^q^), which will be shown to be optimal. 

Suppose in the optimal rebuilding algorithm of C, for 
column f, elements of rows / = {/i,/2/ ■ • ■ //»} ^"e rebuilt 
by zigzags, and the rest by rows. In C, all the s columns 
corresponding to /, are rebuilt in the same way: the elements 
in rows / are rebuilt by zigzags. 

W.l.o.g. assume column is erased. Since column r 
t G [l,s — 1] corresponds to the same zigzag permutation 
as the erased column, for the erased element in the /-th 
row, no matter if it is rebuilt by row or by zigzag, we 
have to access the element in the /-th row and column /(^^ 
(e.g. permutations /d'^^,/o^^ and the corresponding columns 

in Figure H]). Hence all the elements in column 
must be accessed. Moreover, the optimal way to access the 
other surviving columns can not be better than the optimal 
way to rebuild in the code C. Thus the proposed algorithm 
has optimal rebuilding ratio. 

When column z'*^) is erased, the average (over all i E 
[0,k — 1]) of the number of elements needed to be accessed 
in columns for all / e [0,A: - 1], / 7^ i and t € [0,s - 1] 
is 

R{C)p{k + l)-p. 

Here the term —p corresponds to the access of the parity nodes 
in C. Moreover, we need to access all the elements in columns 
i^'\0 < t < s — 1, and access p elements in the two parity 
columns. Therefore, the rebuilding ratio is 

s(R(C)p(fc+ 1) - p) + (s - l)p + p 



RiC) 



p(sfc + l) 



R{C) 



s{k + l) 



R{C){1 



sk + 1 
s - 1 



sk + 1' 



8 



Figure4. A 2-duplication of the code in Figure[3] The code has 6 information nodes and 2 parity nodes. The ratio is 4/7. 



and the proof is completed. ■ 
Theorem |9] gives us the ratio of the s- duplication of a code 
C as a function of its ratio R{C). As a result, for the optimal- 
ratio code in Theorem [T] the ratio of its duplication code is 
slightly more than 1/2, as the following corollary suggests. 

Corollary 10 The s-duplication of the code in Theorem [T] has 
ratio 1(1 + which is \ + for Jarge s. 

For example, we can rebuild the column l'*^' in Figure 
|4]by accessing the elements in rows {0,1} and in columns 
0(0),2(°),0(l),2(l),R,Z, and all the elements in column \^^\ 
The rebuilding ratio for this code is 4/7. 

Using duplication we can have arbitrarily large number of 
columns, independent of the number of rows. Moreover the 
above corollary shows that it also has an almost optimal ratio. 

Next we will show that if we restrict ourselves to codes con- 
structed using Construction [T] and duplication, the code using 
the standard basis and duplication has optimal asymptotic rate. 

In order to show that, we define a related graph. Define the 
directed graph = Dm(y,E) as V = {w eW^' :w ^ 0}, 
and E = {{wi,W2) : |jx'2\ii'i| = 1 mod 2}. Hence the 
vertices are the nonzero binary vectors of length m, and 
there is a directed edge from lUi to IU2 if |ii'2\'^l| is odd 
size. From any induced subgraph H of D,„, we construct 
the code C{H) from the vertices of H using Construction 
[U By Lemma |2] we know that a directed edge from zui to 
W2 in H means fw^{Xi„^) n fwii^Wi) = 0. so only half 
of the information from the column corresponding to w\ is 
accessed while rebuilding the column corresponding to W2. 
For a directed graph D = D{V,E), let S and T be two 
disjoint subsets of its vertices. We define the density of the 
set S to be dg = j|p and the density between S and T to be 

dgj = 2^s\'\T\ ' '^here Eg is the number of edges with both of 
its endpoints in S, and £5 7 is the number of edges incident 
with a vertex in S and a vertex in T. The following theorem 
shows that the asymptotic ratio of any code constructed using 
Construction [T] and duplication is a function of the density of 
the corresponding graph H. 

Theorem 11. Let H be an induced subgraph ofD,„. LetCs{H) 
be the s-duplication of the code constructed using the vertices 



ofH and Construction\l\ Then the asymptotic ratio ofCs{H) is 
limR{Cs{H)) = l-^ 

Proof: Let the set of vertices and edges of H be V{H) = 
{vi} and E(H) respectively. Denote by v\, Vi e V{H),l E 
[0, s — 1], the Z-th copy of the column corresponding to the 
vector Vi. In the rebuilding of column vl, I E [0, s — 1] each 
remaining systematic column v^, k E [0, s — 1], needs to access 
all of its 2™ elements unless \vj\vj\ is odd, and in that case 
it only has to access 2"'^^ elements. Hence the total amount 
of accessed information for rebuilding this column is 

(s|V(H)|-l)2'"-deg+(i;,)s2™-\ 

where deg^ is the indegree of Vj in the induced subgraph H. 
Averaging over all the columns in Cs{H) we get the ratio: 

mm) 

_E,/gc,(H)(s|^(-^)l - 1)2" - deg+(z;,)s2'"-i 

~ s|y(H)|(s|y(H)| + 1)2'" 

_5|V(H)|(s|V(H)|-l)2'"-52E.,gv(H)deg+K)2'"-i 

s|V(H)|(s|V(H)| + l)2'« 
_ s\V{H)\{s\V{H)\ -1)2'" -s^|E(H)|2'"-i 
~ s|y(H)|(s|V(H)| +1)2'" • 

Hence 

lim K(C,(H)) = 1 - = 1 - ^. 

■ 

We conclude from Theorem [TT| that the asymptotic ratio of 
any code using duplication and a set of binary vectors {c;} is a 
function of the density of the corresponding induced subgraph 
of Dffi with {vi} as its vertices. Hence the induced subgraph 
of Dm with maximal density corresponds to the code with 
optimal asymptotic ratio. It is easy to check that the induced 
subgraph with its vertices as the standard basis {eij'JL^ has 
density In fact this is the maximal possible density 

among all the induced subgraph as Theorem [13] suggests, but 
in order to show it we will need the following technical lemma. 
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Lemma 12 Let D = D{V,E) be a directed graph and S, T be 
a partition ofV, i.e., S HT = (Z>, S U T = V, then 

dy < max{ds, dj, dg 7} 
Proof: Note that dy = \S\'ds+\T\'dr+2\S\\T\ds,T _ ^^^ ^ 
assume that > dj therefore if dg > Dg j, 
_ \S\^ds + \T\^dT + 2\S\\T\dsj 



< 



|y|2 

\S\^ds + \T\^ds - \T\^ds + \T\^dT + 2\S\\T\ds 

\W 



_ ds(|S| + |T|)2-|r|2(ds-dr) 

|y|2 

< ds. 

If dsj > max{ds,dj} then, 
dy 



S\^ds + |r|2dT + 2|S||r|ds,r 



< 



|V|2 

|S|2ds,T+ |T|2ds,T + 2|S||r|ds,T 



and the result follows. ■ 
Now we are ready to prove the optimality of the duplication 
of the code using standard basis, if we assume that the number 
of copies s tends to infinity. 

Theorem 13 For any induced subgraph H of Dm, d^ < 
So the optimal asymptotic ratio among all codes constructed us- 
ing duplication and Constructioi^is 5(1 + ^) and is achieved 
using the standard basis. 

Proof: We say that a binary vector is an even (odd) vector 
if it has an even (odd) weight. For two binary vectors Wi,W2, 
iW2\^l| being odd is equivalent to 

1 = W2-WI = W2 - ((1, 1) + ivi) = ||ji'2||l + ^2 ■ H'l- 

Hence, one can check that when Wi, W2 have the same parity, 
there are either no edges or 2 edges between them. Moreover, 
when their parities are different, there is exactly one edge 
between the two vertices. 

When m = 1, the graph Di has only one vertex and the 
only nonempty induced subgraph is itself, d^ = djj^ = = 
When m — 2, the graph D2 has three vertices and one 
can check that the induced subgraph with maximum density 
contains lOi = (1,0), ^2 = (0/1)' ™d the density is 1/2 = 
(m — l)/m. 

For m > 2, assume to the contrary that there exists a 
subgraph of Dm with density higher than ^^7^. Let H be 
the smallest subgraph of Dm (with respect to the number of 
vertices) among the subgraphs of Dm with maximal density. 
Hence for any subset of vertices S C V{H), we have 
ds < dn- Therefore from Lemma [12] we conclude that for 
any partition S, T of V{H), dn < dsj- If H contains both 
even and odd vectors, denote by S and T the set of even and 
odd vectors of H respectively. Since between any even and 



any odd vertex there is exactly one directed edge we get that 

dn < dsj 



J. However 



m 



1 

- < 

2 m 



<d 



and we get a contradiction. Thus H contains only odd vectors 
or even vectors. 

Let V{H) = {vi, ...,vif}. If this set of vectors is indepen- 
dent then k < m and the outgoing degree for each vertex 
V; is at most k — 1 hence dfj = < ^^r^ < 

and we get a contradiction. Hence assume that the dimension 
of the subspace spanned by these vectors in F™ is J < A: 
where Vi,V2,...Vi are basis for it. Define S = {vi, ...Vi},T = 
{vi+l, ...,vif}. The following two cases show that the density 
can not be higher than 

H contains only odd vectors: Let u E T. Since u e 
span{S} there is at least one v E S such that m ■ c 7^ and 
thus {u,v), {v,u) i E{H), therefore the number of directed 
edges between u and S is at most 2{l — 1) for all u E T, 
which means 



dn < d.q T < 



S,T 



2{1 - 1)\T\ 



2\S\\T\ 



< 



m 



m 



and we get a contradiction. 

H contains only even vectors: Since the z;,'s are 
even the dimension of span{S} is at most m — 1 (since 
for example (1,0, ...,0) i spanjS}) thus / < m — 1. 
Let H* be the induced subgraph of D^+i with vertices 
V{H*) = {{l,Vi)\vi E V{H))}. It is easy to see 
that all the vectors of H* are odd, ((1, i;,), (l,z;^)) E 
E{H*) if and only if {vi,Vj) E E{H), and the dimension of 
span{y(H*)} is at most I + 1 < m. Having already proven 
the case for odd vectors, we conclude that 



dn = du* < 

< 
< 



dim(span{V(H*)}) - 

dim(span{y(H*)}) 

/ + 1-1 

/ + 1 
m — 1 



m 



and we get a contradiction. ■ 

V. Finite Field Size of a Code 

In this section, we address the problem of finding proper 
coefficients in the parities in order to make the code MDS. We 
have already shown that if a code is over some large enough 
finite field F, it can be made MDS (Theorem|5]l. And we have 
shown that the optimal code in Theorem [1] needs only field 
of size 3. In the following, we will discuss in more details 
on the field size required to make two kinds of codes MDS: 
(1) duplication of the optimal code in Corollary (TO] and (2) 
a modification of the code in Example |2l Note that both the 
codes have asymptotic optimal ratio. 

Consider the duplication of the optimal code (the code in 
Corollary [Toll. For the s-duplication code C' in Theorem [TOl 
denote the coefficients for the element in row and column 



y(') by ajy and ^jy, < f < s - 1. Let F, be a field of size 



.(f) 
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q, and suppose its elements are {0, a^, a^, . . . ,a'^ ^} for some 
primitive element a. 

Constructions For the s-duplication code C in Theorem [TOl 
over Fq, assign aj^ = 1 for all i, j, t. For odd q,lets < q — 1 
and assign for aJJ f G [0, s — 1] 



ifuj ■ i 



where Uj = Y1\^q ^l- Por even q (power of 2), let s < q — 2 and 
assign for alltE [0, s — 1] 



^t+l 



ifUj ■ i 



Notice that the coefficients in each duplication has the same 
pattern as Construction [2] except that values 1 and 2 are 
replaced by and a*'^^ if q is odd (or a*^^ and a^*^^ if 
q is even). 

Theorem 14 Construction^ is an (s(m + 1) + 2,s(m + 1)) 
MDS code. 

Proof: For the two elements in columns and 
row r, ti 7^ t2, we can see that they are in the same row 
set and the same zigzag set. The corresponding two equations 
from the two parities are linearly independent iff 

^ f^f, (8) 

which is satisfied by the construction. 

For the four elements in columns f (*i),y(*2) ^^d rows r, r' = 
r + £/ + Cj, < ti,t2 < s — 1, < i < i < m, the code is 
MDS if 

aih)f,{h) , fl(f2)o(f2) 
Pr,i Pr',i r= Pr,j Pr',j 



by (O. By the remark after Theorem |3] we know that fi^^^^^ ^ 



/5(y, and f>^if 



for some x. When q is odd, 



a 



2ti+l 



2x 



for any x and fj. When q is even. 



2x 



for any ti and 1 < x < q — 2 (mod q — 1). And by construc- 
tion, X = t2 + l or X = —t2 — 1 for < f2 < s — 1 < — 3, 
sol<x<q — 2 (mod q — 1). Hence, the construction is 
MDS. ■ 
Remark: For two identical permutations f^^^^ = ff^\ (El 
is necessary and sufficient condition for an MDS code. 

Theorem 15 For an MDS s-duplication code, we need a finite 
field Wq of size q > s + 1. Therefore, Theorem fT4l is optimal 
for odd q. 

Proof: Consider the two information elements in row i 
and columns /'i^/*^', which are in the same row and zigzag 
sets, for ti ^t2e [0,s - 1]. The code is MDS only if 

(fi) (ti) 
^h] Pi,i 



has full rank. All the coefficients are nonzero (consider 
erasing a parity column and a systematic column). Thus, 

{c^ffr^^ff + {o<-tfy^^^!j\ and (a[-V^|6SJ are distinct 
nonzero elements in F^, for t G [0, s — 1]. So > s + 1. ■ 

For instance, the coefficients in Figure |4] are assigned as 
Construction |3] and F3 is used. One can check that any two 
column erasures can be rebuilt in this code. 

Consider for example an s-duplication of the code in Theo- 
rem[T]with m = 10, the array is of size 1024 x (lis + 2). For 
s = 2 and s = 6, the ratio is 0.522 and 0.537 by Corollarv [TOl 
the code length is 24 and 68, and the field size needed can be 
4 and 8 by Theorem [T4l respectively. Both of these two sets 
of parameters are suitable for practical applications. 

As noted before the optimal construction yields a ratio of 
1/2 + 1/ m by using duplication of the code in Theorem [T] 
However the field size is a linear function of the number of 
duplications of the code. Is it possible to extend the number 
of columns in the code while using a constant field size? We 
know how to get 0{nr') columns by using 0{m^) duplications 
of the optimal code, however, the field size is 0{m^). The 
following code construction has roughly the same parameters: 
0{m'^) columns and an ratio of j + 0(-^), however it requires 
only a constant field size of 9. Actually this construction is a 
modification of Example |2] 



Construction 4 Let 3\m, and consider the following set of 
vectors S C ¥'2': for each vector v = {vi, . . . ,v„i) E S, 
\\v\\i = 3 and Vi^,Vi^,Vi^ = 1 for some ii € [l,m/3],i2 E 
[m/3 + l,2)«/3],/3 E [2m/3 + l,m]. For simplicity, we 
write V = {ii,!2/?3}- Construct the {k + 2,k) code as in 
Construction [T] using the set of vectors S, hence the number 
of systematic columns is k = \S\ = {'-j)^ = 'jy. For any 
i E [777; /3 + 1, (y + 1 ) m /3] and some j = 0,1,2, define a row 
vector Mi = Yl\=jm/3+i £/■ Then define am x3 matrix 



Mv 



Mj Mj 

'1 '2 



Mj 

'3 



for V = Z2, 73}. Let a be a primitive element of F9. Assign 
the row coefficients as 1 and the zigzag coefficient for row r, 
column V as a\ where t = rMv E F^ (in its binary expansion). 

For example, let m = 6, and v = {1,4,6} = 
(1,0,0,1,0,1) E S. The corresponding matrix is 



Mv 



1 
110 
1 1 



For row r = 26 = (0, 1, 1, 0, 1, 0), we have 
t = rMv = (0,1,1) = 3, 
and the zigzag coefficient is a^. 

Theorem 16 Construction |4] is a (k + 2, k) MDS code with 
array size 2"' x (k + 2) and k = m^/27. Moreover, the 
rebuilding ratio is\ + ^ for large m. 

Proof: For each vector v E S, there are 3(m/3 — Vf- 
vectors u E S such that they have one 1 in the same location 
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as V, i.e. \v\u\ = 2. Hence by Theorem [T] and Lemma |2] for 
large m the ratio is 

^2 



3((f)-l)^ 
2( 



27 



9_ 
2m' 



Next we show that the MDS property of the code 
holds. Consider columns u,v for some u = {h,i2/h} 7^ 
V = [h'h'h) and fi,7i e [l,ni/3\,i2,h ^ [mfi + 
1,2m / 3], i^,]^ e [2wz/3 + l,m]. Consider rows r and r' = 
r + u + V. The condition for the MDS property from (O 
becomes 



^rMji+r'Mj, mod 8 _^ ^rMj,+r'Mj, mod 8 



(9) 



where each vector of length 3 is viewed as an integer in [0, 7] 
and the addition is usual addition mod 8. Since v ^ u, let 
/ G [1, 3] be the largest index such that Z; 7^ W.l.o.g. assume 
that f; < hence by the remark after Theorem [3] 



rMl 7^ r'Ml 



(10) 



and 



rM 



r'M]. 



(11) 



Note that for all f, / < f < 3, U = jt, then since r'Mj^ 



rM 



T 



rMj, we have 



r'Ml 



rM 



r'Ml- 



(12) 



H If ]t 

It is easy to infer from ([T0ll,([n),<[l2ll that the /-th bit in the 
binary expansions of rM„ + r'Mj^ mod 8 and rMj + r'Mj 
mod 8 don't equal. Hence (|9]l is satisfied, and the result 
follows. ■ 

Notice that if we do mod 15 in (|9]l instead of mod 8, 
the proof still follows because 15 is greater than the largest 
possible sum in the equation. Therefore, a field of size 16 is 
also sufficient to construct an MDS code, and it is easier to 
implement in a storage system. 

Construction |4] can be easily generalized to any constant 
c such that it contains 0{m'^) columns and it uses the field 
of size at least 2'^ + 1. For simplicity assume that c\m, and 
simply construct the code using the set of vectors {v} C 
such that \\v\\i = c, and for any G [0, c — 1], there is unique 
ij e [jm/c + 1, (;' + l)m/c] and i^;, = 1. Moreover, the finite 
field of size 2*^+^ is also sufficient to make it an MDS code. 
When c is odd the code has ratio of 

^2 



C 

2m 



for large m. 



VI. Decoding of the Codes 



In this section, we will discuss decoding algorithms of the 
proposed codes in case of column erasures as well as a column 
error. The algorithms work for both Construction [T] and its 
duplication code. 

Let C be a (A: + 2, k) MDS array code defined by Con- 
struction [T] (and possibly duplication). The code has array size 
2'" X {k + 2). Let the zigzag permutations be fj, j e [0, A: — 1], 



which are not necessarily distinct. Let the information ele- 
ments be a, and the row and zigzag parity elements be r, 
and Zj, respectively, for / e [0,2"' - e [0,k-l]. W.l.o.g. 
assume the row coefficients are a, ^ = 1 for all And let 
the zigzag coefficients be fij j in some finite field F. 

The following is a summary of the erasure decoding algo- 
rithms mentioned in the previous sections. 

Algorithm 1 (Erasure Decoding) 
One erasure. 

1 ) One parity node is erased. Rebuild the row parity by 



k-l 
j=0 



and the zigzag parity by 



k-l 

]=0 ' ' 



(13) 



(14) 



2) One information node j is erased. Rebuild the elements in 

rows Xj (see Construction\li) by rows, and those in rows Xj by 

zigzags. 

Two erasures. 

1) Two parity nodes are erased. Rebuild by ( fT3] l and ( fT4] l. 

2) One parity node and one information node is erased. If the 
row parity node is erased, rebuild by zigzags; otherwise rebuild 
by rows. 

3) Two information nodes ji and y'2 are erased. 
- iff-^ = /.^ , for any i e [0, 2"' - 1], compute 



'ff%{^)fff%{iU 



(15) 



Solve Uii , Uii^ from the equations 



" 1 1 ■ 




' "',h ' 




■ Xi ' 













Else, for any i e [0,2™ - 1], set i' 



i + fn{0) + 



fj^ (0), and compute Xi, Xii,yi, y^t according to ( fTsl l. Then solve 
^i,i\'^i,j2' '^i',h'^i',j2 equations 



' 1 


1 





" 
















1 


1 








Xji 


















Vi 


















. Vi' 



In case of a column error, we first compute the syndrome, 
then locate the error position, and at last correct the error. 
Let Xq,Xi, . . . ,Xp_i G F. Denote /^^ (xq/ ^1/ •■•/ ^p-i) = 
(Xyr-i(o),X|-i(i), . . .,X|-i(^_j)) for a permutation / on 
[0, p — 1] . The detailed algorithm is as follows. 

Algorithm 2 (Error Decoding) 
Compute for alii e [0,2'" - 1]; 

k-l 

/=0 

k-l 

/=0 ' ' 
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Let the syndrome be Sq = (so,0/Si,0/ • ■ ■/S2"'-i,o) '^^^ ^1 = 

(S0,l/Sl,l, . . •,S2'"-l,l)- 

-IfSo =0 and Si = 0, there is no error 

- Else if one ofSo, Si is 0, there is an error in the parity. Correct 
it by O or 

- Else, find the error location. For j = tok — 1: 

Compute for all i E [0,2'" - 1], = |6,- yS,',o- 
Let Xj = (xo,;, • • • , ^2™-!,/) and Yj = ff^i^j)- 
IfYj = Si, subtract Sq from column]. Stop. 
If no such j is found, there are more than one error. 

If there is only one error, the above algorithm is guaranteed 
to find the error location and correct it, since the code is MDS, 
as the following theorem states. 

Theorem 17 Algorithm^can correct one column error 

Proof: Notice that each zigzag permutation fj is the 
inverse of itself by Construction [T] or/y = /y^^. Suppose there 

is error in column and the error is £ = (eo/^l/ • ■ - /^i'"-!)- 
So the received column / is the sum of the original information 
and E. Thus the syndromes are s, o = and 

= ^/y{0//y(0- 

For column t, t E [0,k — 1], we have x,- 1 = /5,- fS/ g = f^ij^i- 
Write Yf = = (i/o,t/ ■ • ■/y2"'-l,t) and then 

Vht = ^Mi),t = ?>ft{i),t^f,{i)- 

We will show the algorithm finds Yf = Si iff t = j, and 
therefore subtracting Sq = E from column / will correct the 
error When t = j, yi t = S/ i, for all i G [0,2'" — 1], so 
Yj = Si- Now suppose there is t j such that Yf = Si. Since 
the error E is nonzero, there exists i such that efj[i) 7^ 0. 
Consider the indices / and i' = ftfj{i)- yi,t = s,;i yields 



(16) 



Case 1: When ft = fj, set r = ft{i) = fj{i), then (fT6] i 
becomes ^r,t£r = f^r,fr with e,. ^ 0. Hence j6,-^t = f?>y j which 
contradicts (O. 

Case 2: When ft 7^ fj, since ft,fj are commutative and are 
inverse of themselves, ft{i') = ffftfj{i) = //(O and = 
fjftfj{i) = ft{i)- Therefore y,/ f = s,/ 1 yields 

The two equations ( fTSI l ( [TtI i have nonzero solution 

(^/;(0'^/((0) iff 

which contradicts (O with r = ft{i),^' = fj{i)- Hence the 
algorithm finds the unique erroneous column. ■ 
If the computations are done in parallel for all / G [0, 2'" — 
1], then Algorithm |2] can be done in time 0{k). Moreover, 
since the permutations /,'s only change one bit of a number 
in [0,2"' — 1] in the optimal code in Theorem [T] the algorithm 
can be easily implemented. 



VII. Generalization of the Code Construction 

In this section we generalize Construction [T] to arbitrary 
number of parity nodes. Let n — k = r be the number of 
parity nodes. We will construct an {n,k) MDS array code, i.e., 
it can recover from up to r node erasures for arbitrary integers 
n,k. We will show this code has optimal rebuilding ratio of 
1/r when a systematic node is erased. We assume that each 
systematic nodes stores ^ of the information and corresponds 
to columns [0, A: — 1]. The f-th parity node is stored in column 
k + i,Q<i<r — 1, and is associated with zigzag sets {Z^j : 
i G [0, p — 1] }, where p is the number of rows in the array. 

Constructions Let the information array be A = {cii^j) with 
size r"' x k for some integers k, m. Let T = {vq, C/t-l} ^ 
be a subset of vectors of size k, where for each v = 

{vi,...,v„,) e T, 



gcd{vi,..., v,n,r) = 1, 



(18) 



where gcd is the greatest common divisor For any I, < I < 
r — 1, and v E T we define the permutation fl : [0, r'" — 1] ^ 
[0, r'" — 1] by fl{x) = x + lv, where by abuse of notation we 
use X G [0, r'" — 1] both to represent the integer and its r-ary 
representation, and all the calculations are done over Zy.For 
example, form = 2,r = 3,x = 4,Z = 2,v = (0,1), 

/fo,i) (4) =4 + 2(0,1) = (1,1) + (0,2) = (1,0) = 3. 

One can check that the permutation f^^^ in a vector notation is 
[2, 0, 1, 5, 3, 4, 8, 6, 7] . For simplicity denote the permutation fj,. 

as fj for Vj e T. For t E [0, r'" — 1] , we define the zigzag set Zl 
in parity node I as the elements a,j such that their coordinates 
satisfy fj{i) = t. In a rebuilding of systematic node i the 

elements in rows X' = {x E [0, r'" — 1] : x ■ Vj = r — 1} 
are rebuilt by parity node I, I E [0, r — 1]. From ( fTsl l we get 
that for any i and I, |X'[ = r^~^. 

Note that similar to Theorem |5] using a large enough field, 
the parity nodes described above form an {n,k) MDS array 
code under appropriate selection of coefficients in the linear 
combinations of the zigzags. 

Consider the rebuilding of systematic node i E [0, A: — 1]. In 
a systematic column 7^ / we need to access all the elements 
that are contained in the sets that belong to the rebuilding 
set of column i. Namely, in column / we need to access the 
elements in rows 

( fT9] l follows since the zigzags Z' for any t E //(X') are used 
to rebuild the elements of column i in rows X'. Moreover 
the element in column / and zigzag Zl is flr-^^^^ ■■ The 

following lemma will help us to calculate the size of ( fT9l l, 
and in particular calculating the ratio of codes constructed by 
Construction |5] 

Lemma 18 For any v = [pi, ...v„i), u E ZJ" and I E [0, r — 1] 
such that gcd(j7i, Vm, r) = 1, define Cy^u = v ■ {v — 11) — 1. 
Then 

" (;■-;>,,, =0 

0, o. w. 



\fu'fv{K)nfu'fUxl)\ 
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In particular for j = we get 

\fu'fUxl)nx',\ = 



0, o. w. 



Proof: Consider the group (Z™, +). Note that X° = {x : 
X ■ 2? = 0} is a subgroup of ZJ." and X\, = {x : x ■ v = r — i} 
is its coset. Therefore, X[, = + a[„ X{, = X^ + a^, for some 
fli e X;,fl(, e Xi. Hence /,77i(Xj,) = X^ + fl^ + i{v - u) 
and /„ ^fl,{xj-,) = Xl + a{ + i{v - u) are cosets of X°. So 
they are either identical or disjoint. Moreover they are identical 
if and only if 

a^^-al + {i-i){v-u) eXO, 

i.e., (fly — fli + (' ~ ~ m)) • ^ = 0. But by definition of 
Xj, and Xy, ay-v = —i, a{, ■ v = —j, so (z — ;') ■ c-o^u = and 
the result follows. ■ 
The following theorem gives the ratio for any code of 
Construction |5] 

Theorem 19 The ratio for the code constructed by Construc- 
tion^and set of vectors T is 

1 



+ T 



|T|(|T|-l + r) 

which also equal to 

1 IlveTllueT,u^v \Fu,v{X^) n X§| 
r \T\{\T\ -l + r)r'« ' 

Here we define the function Fu^:,{t) = fu^fvi^) ^^r f G XJ,. 

Proof: By ( fT9] l and noticing that we access elements 
in each parity node, the ratio is 

|r|(|r| -l + r)r'« " ^ ' 

From Lemma [18] and noticing that |{z : icv,u = 
mod r}\ = gcd{r,c-o^it), we get 

r'""^ X r/ gcd{r,Cv,u)- 
And the first part follows. For the second part, 

r|(|r| - 1 + r)r'« 

|xS| + |u-V-'/^(xj,)\xO| + |r|r'" 

|r|(|r| -l + r)r"' 

_ 1 EcgrEu^cgT I u-^i /„ '/o(x^) n x^ 



r |r|(|r| - 1 + r)r'« 

1 HveT'LiieTAi^v \Pu,v{Xy) nX^ 
r |r|(|T| - 1 + r)r'« 



(21) 



The proof is completed. ■ 
Notice that X§ represents elements not accessed for par- 
ity (row parity), and Fu^y{X^) are elements accessed 
for parity 1,2, . . .,r - 1. Therefore Fu,v{X°) H X° are the 
elements accessed excluding those for the row parity. In 
order to get a low rebuilding ratio, we need to mini- 
mize the second term in i2l[ . We say that a family of 



permutation set {{/qYiZq' {fl-iViZo} together with sets 
{{■^o}[=0' "t-^fc-l J"/=0 } ^ family of orthogonal permu- 
tations if for any i,] G [0, A: — 1] the set {XJ}^^q is a equally 
sized partition of [0, r'" — 1] and 



|f;.-(x°)nxo 

rm-\(r — 1 



One can check that for r = 2 the definition coincides with the 
previous definition of orthogonal permutations for two parities. 
It can be shown that the above definition is equivalent to that 
for any < / 7^ ; < - 1, < / < r - 1, 



f]{X^) = fl{X\) 



(22) 



For a set of orthogonal permutations, rebuilding ratio is 1 /r 
by (ISTT i. which is optimal according to ([T]), 

Now we are ready to construct a code with optimal rebuild- 
ing ratio and r parities. 

Theorem 20 The set {{/oj^Zg, {fl„]\Z^] together with set 
{{^o}/=0'-' ■t^m}/=o} constructed by the vectors {ej^^g 
and Construction\5\ where Xq is modified to be Xq = {x G 
: X ■ (1,1,. ..,1) = /} for any I G [0,r-l] is a fam- 
ily of orthogonal permutations. Moreover the corresponding 
(ffx + 1 + r, m + 1) code has optimal ratio of j. 

Proof: For 1 < / 7^ / < m, Ci^j = e,- ■ (e, — Cj) — \ =0, 
hence by Lemma [TS] for any I G [0, r — 1] 

fr^fl{x\)r^x^ = xl 

and (|22] | is satisfied. For 1 <i < m, and all < / < r — 1, 

= fh{v:v, = -l}) = {v + ler.v, = -l} 
= {v:Vi = 0} = X^ 

Therefore, /o"'//(Xj) H Xf = Xf, and ^ is satisfied. 
Similarly, 



f-\{v:v-{l 1) = /}) 

{i;-/e,:i;-(l,...,l) = /} 
{i;:i;.(l,...,l) = 0}=xO. 



Hence again ( l22l i is satisfied and this is a family of orthogonal 
permutations, and the result follows. ■ 

Surprisingly, one can infer from the above theorem that 
changing the number of parities from 2 to 3 adds only one 
node to the system, but reduces the ratio from 1/2 to 1/3 in 
the rebuilding of any systematic column. 

The example in Figure |5] shows a code with 3 systematic 
nodes and 3 parity nodes constructed by Theorem |20] with 
m = 2. The code has an optimal ratio of 1/3. For instance, if 
column Ci is erased, accessing rows {0, 1,2} in the remaining 
nodes will be sufficient for rebuilding. 

Similar to the 2 parity case, the following theorem shows 
that Theorem |20] achieves the optimal number of columns. In 
other words, the number of rows has to be exponential in the 
number of columns in any systematic MDS code with optimal 
ratio, optimal update, and r parities. This follows since any 
such optimal code is constructed from a family of orthogonal 
permutations. 
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Figures. A (6,3) MDS array code with optimal ratio 1/3. Ttie first parity C3 corresponds to tlie row sums, and tlie corresponding identity permutations 
are omitted. Tlie second and third parity C4, C5 are generated by the permutations ffrff respectively, i — 0,1,2. The elements are from F7, where 3 is a 
primitive element. 



Theoremll Let {{/olJ'Zo, {//'_i}';Io} ^ ^^™^>' 
thogonal permutations over the integers [0, r"' — 1] together 
with the sets {{X[,}[Zq,...,{X[_j};Zq}, thenk <m + l. 

Proof: We prove it by induction on m. When m = 
0, it is trivial that k < 1. Now suppose we have a 
family of orthogonal permutations {{/qI/Zq, •-, {//_^}JZq} 
over [0, r'" — 1], and we will show k < m + 1. Recall 
that orthogonality is equivalent ( |22] |. Notice that for any 
permutations g, ho, {{hifog}\zl, {hifl_^g}\zl}} 
are still a family of orthogonal permutations with sets 
{{g-\A)} This is because 

= hif,gig~Hxl)). 

Therefore, w.l.o.g. we can assume Xl = [lr'"-\ (/ + 
l)r"'^^ — 1], and /q is the identity permutation, for < / < 
r-1. 

Let 1 < 7 < - 1, / e [0, r - 1] and define 

B = /;'(xOnxg), 
c = //(xjnxg). 

Therefore B, C are subsets of A, and their compliments in A 
are 

A\B = f'.{x<^nxl), 
A\c = //(x|nxO). 

From (I22I) for any j ^ 0, 

fj{4)=M)=x'o (23) 

hence, 

B,CQXi (24) 



Similarly, for any ; 7^ 0, /-(Xg) = f.{X^) = X^, hence 

A\B,A\CcWq. (25) 
From (|24ll,(|25]) we conclude that B = C = ACiX'q, i.e., 

/;'(xOnxg)=/;'(x,'nxg). (26) 

For each I e [0,r - 1],] £ [l,k- 1] define f'j{x) = f.{x) - 
Ir"'-^ and Xj = Xj fl Xg then, 

f]{[Q,r-'-^-l])=f.[Xl)-lr"^-^ 

= X[- Ir'"^^ (27) 
= [0,r'"-i -1], 

where (|27] | follows from ( l23T l. Moreover, since is bijective 
we conclude that fj is a permutation on [0, r'"~^ ~ !]• 

i:'(x|)=/;'(x|nxg)-/r-i 

= /^?(X°nX°) -/r'"-i (28) 

where (|28]) follows from (|26] |. Since {X'}J^q is a partition 
of [0,r'" - 1], then {X?}^-,^ is also a partition of Xg = 
[0,r'"-l-l]. Moreover, since fl{Xl) = fj{X°) for any 
I G [0, r — 1], and are bijections, we conclude 

\xl\ = \xf\ 

for all / e [0,r-l], i.e., {X?}, / e [0,r- 1], is a equally sized 
partition of [O,/-'""! - 1]. Therefore {{/j }[Zo\ {/Lil^} 
together with {X[_^}[~q} is a family of or- 

thogonal permutations over integers [0, r"'^^ ~ 1]> hence by 
induction k — 1 < m and the result follows. ■ 
After presenting the construction of a code with optimal 
ratio of 1/r, we move on to deal with the problem of assigning 
the proper coefficient in order to satisfy the MDS property. 
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This task turns out to be not easy when the number of parities 
r > 2. The next theorem gives a proper assignment for 
the code with r = 3 parities, constructed by the optimal 
construction given before. This assignment gives an upper 
bound on the required field size. 

Theorem 22 A field of size at most 2{m + 1) is sufficient to 
make the code constructed by Theoreml20l with r = 3 parities, 
a (m + 4, m + 1) MDS code. 

Proof: Let Fq be a field of size q > 2{m + l). For any / G 
[0, m] let A; = (flij) be the representation of the permutation 
f}^ by a permutation matrix with a slight modification and is 
defined as follows, 

a' andy-e, =0 

1 =f andy-e, ^0 

otherwise. 



where a is a primitive element of Wq. Let W be the matrix 
that create the parities nodes, defined as 



W 



Bl 



Bin 
g2 



Where B 



{A,y for / e [0,m] and; E [0,2]. It easy 
to see that indeed block row i E [0,2] in the block matrix 
m corresponds to parity i. We will show that this coefficient 
assignment satisfy the MDS property of the code. First we will 
show that under this assignment of coefficients the matrices 
A; commute , i.e. for any h h & [0,^], Ai^Ai^ 
For simplicity, write /^^ = = /2,A,j = A;^ 

{bi^j),3"' = p. For a vector x = {xq, Xp_i) and y = xAi^, 
its ;-th entry satisfies yj = for all / G [0, p — 1] 

And by similar calculation, z = x A;^ A;^ = yAi^ will satisfy 

Similarly, if w = xA] A] 



then 



/i(;)j^/2{/i(;)),/i(;)^/2{/i(;))- 



Notice that 



Moreover, 



""hdmMiy Similarly, Uf^^^j^^j = «/j(/,(;)),/,o-)- 



Hence, Zj = Wj for all / and 

xA/jA/2 = z = w = xAi^Ai_^ 

for all X e F^. Thus A^A^^ = Ai^A^. 

Next we show for any z. A? = a' J. For any vector x. Let 
y = xA^. Then 

However, /? {]) = j + Se,- = (since the addition is done over 
F™), and exactly one of / ■ e,v/i(/) ■ £i,ff{i) ■ equals to 0. 
Thus yj = a'Xj or xA? = a'x for any x. Hence A^ = a'l. 



The code is MDS if it can recover from loss of any 3 nodes. 
With this assignment of coefficients the code is MDS iff any 
block sub matrices of sizes 1 x 1,2 x 2,3 x 3 of the matrix 
M are invertible. The case of 1 x 1 sub matrix is trivial. Let 
< / < ; < A: < m we will see that the 3 x 3 matrix 



I 

A, 
A? 



I 



is invertible. By Theorem 1 in |[22l and the fact that all the 
blocks in the matrix commute we get that the determinant of 
this matrix equals to det(Aj- — Aj) ■ det(A;, — A,) ■ det(Aj — 
A,). Hence we need to show that for any / > det(A/ — 
Aj) 7^ 0, which is equivalent to det(A/Ari - /) ^ 0. Note 

that for any i. A? = a'l. Denote by A = AjAr^, hence 



-1^3 



A3 = {AiA- , 
^ det(A3 



AfArS 



J) 



a} JI I. Therefore 
det(A - /)det(A2 + A + 7). 



Therefore det(A - /) = det(A/Ari _ /) ^ q. 

For a submatrix of size 2 x 2, we need to check that for 



det( 



I 

^1 



I 

A? 



det(Ap det(A2Ar2 - 1) ^ Q. 



Note that A^ 
m < Hence 

^ dei{A^ 

I ) -- 



(A,Ari)6 = a2(i-/)/ ^ / since < z - ; < 



I) = det(A2 
det(A2Ar2 _ 



I) {A' 



I), 



I) ^ which concludes 



and det(A2 
the proof. 

For example, the coefficients of the parities in Figure |5] are 
assigned as the above proof. Since m = 2, the field of size 7 
is sufficient. The primitive element is chosen to be 3. One can 
check that when losing any three columns we can still rebuild 
them. 

VIII. Rebuilding Multiple Erasures 

In this section, we discuss the rebuilding of e erasures, 1 < 
e < r. We will first prove the lower bound for rebuilding ratio 
and repair bandwidth. Then we show a construction achieving 
the lower bound for systematic nodes. At last we generalize 
this construction and Construction |5] and propose a rebuilding 
algorithm using an arbitrary subgroup and its cosets. 

In this section, in order to simplify some of the results we 
will assume that r is a prime and the calculations are done 
over Fr. Note that all the result can be generalized with minor 
changes for an arbitrary integer r and the ring Z,-. 

A. Lower Bounds 

The next theorem shows that the rebuilding ratio for Con- 
struction |5] is at least e/r. 

Theorem 23 Let A be an array with r parity nodes constructed 
by Construction^ In an erasure of 1 < e < r systematic nodes, 
the rebuilding ratio is at least j. 
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Proof: In order to recover the information in the system- 
atic nodes we need to use at least er™ zigzag sets from the 
j-m+l ggjg (fjigj-g r parity nodes, r™ zigzag sets in each 
parity). By the pigeonhole principle there is at least one parity 
node, such that at least ef"^^ of its zigzag sets are used. Hence 
each remaining systematic node has to access its elements that 
are contained in these zigzag sets. Therefore each systematic 
node accesses at least er'"~^ of its information out of r™, 
which is a portion of ^. 

Since we use at least er'" zigzag sets, we use at least er™ 
elements in the r parity nodes, which is again a portion of p 
Hence the overall rebuilding ratio is at least ^. ■ 

In a general code (not necessary MDS, systematic, or 
optimal update), what is the amount of information needed 
to transmit in order to rebuild e nodes? Assume that in the 
system multiple nodes are erased, and we rebuild these nodes 
simultaneously from information in the remaining nodes. It 
should be noted that this model is a bit different from the 
distributed repair problem, where the recovery of each node 
is done separately. We follow the definitions and notations of 
II23I . An exact-repair reconstructing code satisfies the follow- 
ing two properties: (i)Reconstruction: any fc nodes can rebuild 
the total information. (ii)Exact repair: if e nodes are erased, 
they can be recovered exactly by transmitting information from 
the remaining nodes. 

Suppose the total amount of information is and the n 
nodes are [«] . For e erasures, 1 < e < r, denote by a, dg, 
the amount of information stored in each node, the number 
of nodes connected to the erased nodes, and the amount of 
information transmitted by each of the nodes, respectively. For 
subsets A, B C [n], W/i is the amount of information stored 
in nodes A, and is the amount of information transmitted 
from nodes A to nodes B in the rebuilding. 

The following results give lower bound of repair bandwidth 
for e erasures, and the proofs are based on ||231 . 

Lemma 24 Let B C [«] be a subset of nodes of size \e\, then 
for an arbitrary set of nodes A, \A\ < de such that B n A = 0, 

H{Wb\Wa) <mm{\B\a,{d,-\A\)^,}. 

Proof: If nodes B are erased, consider the case of con- 
necting to them nodes A and nodes C, |C| = — |A|. Then 
the exact repair condition requires 



= H{Wb 


cB cB\ 


= H{Wb 






> H{Wb 


sD- 




> H{Wb 


sD- 


id-\A\)fie 


> H{Wb 




-(d-|A|)/5, 



Moreover, it is clear that H(Wb|W^) < H{Wb) < \B\a and 
the result follows. ■ 

Theorem 25 Any reconstructing code with hie size A4 must 
satisfy for any 1 < e < r 

LlJ-i 

A4<sa+ (df — — s)j6e} 

1=0 



where s = k mod e, < s < e. Moreover for an MDS code, 

P*" - k{d~k+e)- 

Proof: The file can be reconstructed from any set of k 
nodes, hence 



= H{Wls])+ E H(W[,e+s+l,(m).+s]l>^[i.+s]) 
1=0 

LlJ-1 

< sa + ^ minjea, (de — fe — s)/3(;}. 
1=0 

In an MDS code a = hence in order to satisfy the 
inequality any summand of the form min{ea, {de — ie — s)fie} 
must be at least e^, which occurs if and only if (de — ( [|j — 
l)e - s)^e > Hence we get 



eM 



k{d-k + e)' 



And the proof is completed. ■ 
Therefore, the lower bound of the repair bandwidth for an 



MDS code is 



eM 



which is the same as the lower bound 



k{d-k+e_ 

of the rebuilding ratio in Theorem 



B. Rebuilding Algorithms 

Next we discuss how to rebuild in case of e erasures, 1 < 
e < r, for an MDS array code with optimal update. Theorem 
|25] gives the lower bound e/r on the rebuilding ratio for e 
erasures. Is this achievable? Let us first look at an example. 

Example 3 Consider the code in Figure^ with r = 3. When 
e = 2 and columns Cq, Ci are erased, we can access rows 
{0, 1, 3, 4, 6, 7} in column C2, C3, rows {1, 2, 4, 5, 7, 8} in col- 
umn C4, and rows {2, 0, 5, 3, 8, 6} in column C5. One can check 
that the accessed elements are sufficient to rebuild the two 
erased columns, and the ratio is 2/3 = e/r. It can be shown 
that similar rebuilding can be done for any two systematic 
node erasures. Therefore, in this example the lower bound is 
achievable. 

Consider an information array of size p x k and an [n, k) 
MDS code with r = n — k parity nodes. Each parity node 
I E [0, r — 1] is constructed from the set of permutations {/' } 
for i G [0, fc — 1]. Notice that in the general case the number 
of rows p in the array is not necessarily a power of r. We 
will assume columns [0,e — 1] are erased. In an erasure of e 
columns, ep elements need rebuilt, hence we need ep equations 
(zigzags) that contain these elements. In an optimal rebuilding, 
each parity node contributes ep/r equations by accessing the 
values of ep/r of its zigzag elements. Moreover, the union 
of the zigzag sets that create these zigzag elements, constitute 
an e/r portion of the elements in the surviving systematic 
nodes. In other words, assume that we access rows X from the 
surviving columns [e,k — 1], X C [0,p — l],then |X| = ep/r 
and 

f'j{X)=fjiX) 
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for any parity node I E [0, r — 1] and /,/ e [e,k — l\. Note 
that it is equivalent that for any parity node / G [0, r — 1] and 
surviving systematic node G [e,k— 1] 

fj{X)=fi{X). 

Let G' be the subgroup of the symmetric group Sp that is 
generated by the set of permutations I/jT' °/f It is easy 
to see that the previous condition is also equivalent to that for 
any parity I G [0, r — 1] the group G' stabilizes X, i.e., for 
any / e G',/(X) = X. 

Assuming there is a set X that satisfies this condition, we 
want to rebuild the ep elements from the chosen ep equations, 
i.e., the ep equations with the ep variables being solvable. A 
necessary condition is that each element in the erased column 
will appear at least once in the chosen zigzag sets (equations), 
parity I E [0, r — 1] accesses its zigzag elements fl{X), and 
these zigzag sets contain the elements in rows {fj)~^fl{X) 
of the erased column i E [0, e — 1]. Hence the condition is 
equivalent to that for any erased column i E [0,e — 1] 

ud(//)"Vi(x) = [o,p-i]. 

These two conditions are necessary for optimal rebuilding 
ratio. In addition, we need to make sure that the ep equations 
are linearly independent, which depends on the coefficients in 
the linear combinations that created the zigzag elements. We 
summarize: 

Sufficient and necessary conditions for optimal rebuilding 
ratio in e erasures: There exists a set X C [0, p — 1] of size 
|X| = ep/r, such that 

1) For any parity node I E [0, e — 1] the group G' stabilizes 
the set X, i.e., for any g E 



(29) 



where G' is generated by the set of permutations 
2) For any erased column / E [0, e — 1], 

uU(/i')-Vi(x) = [o,p-i]. 



(30) 



3) The ep equations (zigzag sets) defined by the set X are 
linearly independent. 

The previous discussion gave the condition for optimal 
rebuilding ratio in an MDS optimal update code with e 
erasures in general. Next will interpret these conditions in 
the special case where the number of rows p = r'", and 
the permutations are generated by T = {vq,v^, . . . ,V](^i} 
C F™ and Construction |5] i.e., fj{x) = x + Ivj for any 
X E [0, r™ — 1]. Note that in the case of r a prime 



G^ 



G 



r-1 



and in that case we simply denote the group as G. The 
following theorem gives a simple characterization for sets that 
satisfy condition 1. 

Theorem 26 Let X C FJ." and G defined above then G 
stabilizes X, if and only if X is a union of cosets of the subspace 



Proof: It is easy to check that any coset of Z is stabilized 
by G, hence if X is a union of cosets it is also a stabilized 
set. For the other direction let x,y E F™ be two vectors in the 
same coset of Z, it is enough to show that if x G X then also 
y E X. Since y — x E Z there exist ai, O-k-l-e G [0/ ~ 1] 
such that y — X = Ef^j ""^ a/(i^e+/ — Ve). Since /(X) = X for 
any / G G we get that /(x) G X for any x G X and / G G, 
hence 

y = X + y — X 

k-l-e 

= X+ ^ aLi{v,+i-Ve) 
i=l 

-«-k-\-e fIX-k-l-e r-O.^ rOii / \ 
Jk-1 ■••/c Je+\\^l' 



for '**"'"7^"*i'"''.../(r*V*|i G G. So y G X and the result 
follows. ■ 
Remark: For any set of vectors S and v,u E S, 

span{S — u} = span{S — m}. 

Here S — v = \v\ — v\v\ E S}. Hence, the subspace Z defined 
in the previous theorem does not depend on the choice of the 
vector Ve- By the previous theorem we interpret the necessary 
and sufficient conditions of an optimal code as follows: 

There exists a set X C F™ of size |X| = er'"^^, such that 

1) X is a union of cosets of 

Z = span{i;t.+i -Vg,..., f /t-i - v^}. 

2) For any erased column i E [0, e — 1], 



F" 



(32) 



3) The er™ equations (zigzag sets) defined by the set X are 

linearly independent. 
The following theorem gives a simple equivalent condition 
for conditions 1, 2. 

Theorem 27 There exists a set X C of size \X\ = er'"^^ 
such that conditions 1, 2 are satisfied if and only if 



(33) 



for any erased column i E [0, e — 1] . 

Proof: Assume conditions 1,2 are satisfied. If — v^eZ 
for some erased column i E [0, e — 1] then X = ^\Z^{X + 
l{vi — Ve)) = FJ.", which is a contradiction to X C F™. On 
the other hand. If ( [33] ) is true, then Vi — Vg can be viewed as 
a permutation that acts on the cosets of Z. The number of 
cosets of Z is r'"/|Z| and this permutation (when it is written 
in cycle notation) contains f"~^ /\Z\ cycles, each with length 
r. For each / G [0, e — 1] choose r'"~^/|Z| cosets of Z, one 
from each cycle of the permutation Vi — Vg. In total er'"~^/|Z| 
cosets are chosen for the e erased nodes. Let X be the union 
of the cosets that were chosen. It is easy to see that X satisfies 
condition 2. If |X| < er'"^^ (Since there might be cosets that 
were chosen more than once) add arbitrary (er™"^ — | X | ) / 1 Z | 
other cosets of Z, and also condition 1 is satisfied. ■ 
In general, if ( |33] | is not satisfied, the code does not have 
an optimal rebuilding ratio. However we can define 



spanju, 



e+l 



■,Vk-l-'"e}- 



(31) 



Z = span{z;,' - Uejiej, 



(34) 
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where we assume w.l.o.g. e £ J and I C [e, — 1] is a maximal 
subset of surviving nodes that satisfies for any erased node 
/ G [0, e — ^rVj — Veil.. Hence from now on we assume 
that Z is defined by a subset of surviving nodes /. This set 
of surviving nodes will have an optimal rebuilding ratio (see 
Corollary [30l l. i.e., in the rebuilding of columns [0, e — 1], 
columns I will access a portion of e/r of their elements. 
The following theorem gives a sufficient condition for the er'" 
equations defined by the set X to be solvable linear equations. 

Theorem 28 Suppose that there exists a subspace Xq that 
contains Z such that for any erased node i G [0, e — 1] 

Xo © span{z;,' - i^^} = Fj", (35) 

then the set X defined as an union of some e cosets of Xq 
satisfies conditions 1, 2 and 3 over a field large enough. 

Proof: Condition 1 is trivial. Note that by ( |35] ), l{vj — 
Ve) i Xq for any I E [l,r — 1] and i E [0, e — 1], hence 
{Xq + l{vi — ^^e)}/e[o,r-l] the set of cosets of Xq. Let Xj = 
Xq + j{vi — Ve) be a coset of Xq for some / G [0, e — 1] and 
suppose Xj C X. Now let us check condition 2: 

U[zi(X + l{vi -ve))2 U'lZliXj + l{v, - Ve)) 

= U[=o(Xo + - Ve) + l{Vi - Ve)) 
= U\zliXQ + {i + l)iv,-Ve)) 

= U\zl{Xo + t{v^-Ve)) (36) 
= F™. (37) 

(l36l l holds since j + I is computed mod r. So condition 2 is 
satisfied. Next we prove condition 3. There are er'" unknowns 
and er"' equations. Writing the equations in a matrix form we 
get AY = b, where A is an er'" x er'" matrix. Y, b are vectors 
of length er™, and Y = (yj, ...,1/^,.™)"'^ is the unknown vector. 
The matrix A = (cii^j) is defined as fl/ y = x, j if the unknown 
yj appears in the i-th equation, otherwise fl/y = 0. Hence 
we can solve the equations if and only if there is assignment 
for the indetermediates {x/y} in the matrix A such that 
det{A) 7^ 0. By ( |37] |. accessing rows corresponding to any 
coset Xj will give us equations where each unknown appears 
exactly once. Since X is a union of e cosets, each unknown 
appears e times in the equations. Thus each column in A 
contains e inde terminates. Moreover, each equation contains 
one unknown from each erased node, thus any row in A 
contains e inde terminates. Then by Hall's Marriage Theorem 
ll24l we conclude that there exists a permutation / on the 
integers [1, er'"] such that 

er"' 

Hence the polynomial det{A) when viewed as a symbolic 
polynomial, is not the zero polynomial, i.e., 

er"' 

det(A)= J] sgn(/)n «,-,/(/) 7^0. 

feS„m 1=1 

By Theorem |33] we conclude that there is an assignment from 
a field large enough for the indeterminates such that det( A) 7^ 



0, and the equations are solvable. Note that this proof is for a 
specific set of erased nodes. However if ( [35] l is satisfied for any 
set of e erasures, multiplication of all the nonzero polynomials 
det(A) derived for any set of erased nodes is again a nonzero 
polynomial and by the same argument there is an assignment 
over a field large enough such that any of the matrices A is 
invertible, and the result follows. ■ 
In order to use Theorem |28] we need to find a subspace Xg 
as in ( [35] l. The following theorem shows that such a subspace 
always exists, moreover it gives an explicit construction of it. 

Theorem 29 Suppose 1 < e < r erasures occur Let Z be 
defined by ( l34b and Vi — VeiZ for any erased node i G [0, e — 
1] . Then there exists u ^ Z such that for any G [0, e — 1] , 

u ■ {Vi - Ve) + 0. (38) 

Moreover the orthogonal subspace Xq = (m)^ satisfies ([35]). 

Proof: First we will show that such vector u exists. Let 
Ul,...Ui be a basis for (Z)^ the orthogonal subspace of Z. 
Any vector 11 in {Z)-^ can be written as u = Yl'j=i^jUj for 
some Xy's. We claim that for any i G [0, e — 1] there exists 
/ such that Uj ■ [v; — Vg) 7^ 0. Because otherwise, (Z)^ = 
span{Mi, . . . , Mf } ± Vi — Ve, which means c,- — Ve E Z and 
reaches a contradiction. Thus the number of solutions for the 
linear equation 

t 

XjUj ■ {Vi -Ve) =0 

/=1 

is r^^^, which equals the number of ti such that u ■ (u, — Ve) = 
0. Hence by the union bound there are at most er'^^ vectors 
u in (Z)-'- such that u ■ (u, — Ve) = for some erased node 
/ G [0, e — 1]. Since |(Z)-'-| = r' > er*~^ there exists u in 
(Z)^ such that for any erased node f G [0, e — 1], 

u ■ {Vj — Ve) 7^ 0. 

Define Xq = {u)^, and note that for any erased node i G 
[0,e — l],Vi — Vei Xq, since u ■ (u, — Vg) 7^ and Xq is the 
orthogonal subspace of u. Moreover, since Xq is a hyperplane 
we conclude that 

Xq © spanju/ - Ve} = F™, 

and the result follows. ■ 
Theorems |28] and |29] give us an algorithm to rebuild 
multiple erasures: 

1) Find Z by (El satisfying 

2) Find u ± Z satisfying ( |38] |. Define Xq = (m)^ and X 
as a union of e cosets of Xq. 

3) Access rows /j(X) in parity I G [0, r — 1] and all the 
corresponding information elements. 

We know that under a proper selection of coefficients the 
rebuilding is possible. 

In the following we give two examples of rebuilding using 
this algorithm. The first example shows an optimal rebuilding 
for any set of e node erasures. As mentioned above, the optimal 
rebuilding is achieved since ( [33l l is satisfied, i.e., / = [e,k — l]. 
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Example 4 Let T = {vq, Vi,..., Vm} be a set of vectors that 
contains an orthonormal basis of F"' together with the zero 
vector. Suppose columns [0, e — 1] are erased. Note that in that 
case I = [e, m] and Z is defined as in ( |34] |. Define 

m 

w = E ^i' 

and Xq = (w)^- When m = r and e = r — 1, modify u to be 

m 

i=l 

It is easy to check that u ± Z and for any erased column 

1 G [0,e — l],u ■ {vi — Ve) = —1. Therefore by Theorems 
[28] and [29] a set X defined as a union of an arbitrary e cosets 
of Xq satisfies conditions 1, 2 and 3, and optimal rebuilding is 
achieved. 

In the example of Figure [5] we know that the vectors gen- 
erating the permutations are the standard basis (and thus are 
orthonormal basis) and the zero vector When columns Cq, C\ 
are erased, u = 62 and Xq = (m)^ = spanjei} = {0,3,6}. 
Take X as the union of Xq and its coset {1,4,7}, which is the 
same as Example [3] One can check that each erased element 
appears exactly 3 times in the equations and the equations are 
solvable in F7. Similarly, the equations are solvable for other 

2 systematic erasures. 

Before we proceed to the next example, we give an upper 
bound for the rebuilding ratio using Theorem [28] and a set of 
nodes I. 

Corollary 30 Theorem\2E\requires rebuilding ratio at most 

e {r — e){k— \I\ — e) 
r r{k + r — e) 

Proof: By Theorem [28] the fraction of accessed elements 
in columns I and the parity columns is e/r of each column. 
Moreover, the accessed elements in the rest columns are at 
most an entire column. Therefore, the ratio is at most 

f(|/|+r) + (fc- |Z| -e) _ e {r - e){k - \I\ - e) 
k + r — e r r{k + r — e) 

and the result follows. ■ 
Note that as expected when \I\ = k — e the rebuilding ratio 
is optimal, i.e. e/r. In the following example the code has 
O(m^) columns. The set I does not contain all the surviving 
systematic nodes, hence the rebuilding is not optimal but is at 
most 1 +0(1). 

Examples Suppose 2\m. Let T = {v = {vi,...,Vm) ■ 
\\v\\i = 2,Vi = l,Vj = 1, forsomei E [l,m/2],i E 
[m/2 + l,m]} C be the set of vectors generating the code 
with r = 2 parities, hence the number of systematic nodes is 
\T\ = k = Suppose column IV = {iv^, . . . ,iVm), ivi = 

^m/2+1 = 1 erased. Define the set I = {v E T : Vi = 0}, 
and 

Z = span{z7/ — Vg\i E 1} 

for some e E L Thus \I\ = m{m — 2)/4. It can be seen that 
Z defined by the set I satisfies ( [33] l, i.e., zv — Vg ^ Z since the 
first coordinate of a vector in Z is always 0, as oppose to 1 for 



the vector w — v^. Define u = (0, 1, 1) and Xq = (m)^- 
is easy to check that m _L Z and u ■ {w — v^) =1 7^ 0. Hence, 
the conditions in TJ]eoreml29lare satisfied and rebuilding can be 
done using Xq. Moreover by CoroJJarv[30l the rebuilding ratio 
is at most 

1 1 (m/2) - 1 _ 1 1 

2 2 (m2/4) +1 ^2^m' 

which is a little better than Theorem [T6l in the constants. Note 
that by similar coefficients assignment of Construction [4] we 
can use a field of size 5 or 8 to assure the code will be an MDS 
code. 

C. Minimum Number of Erasures with Optimal Rebuilding 

Next we want to point out a surprising phenomena. We say 
that a set of vectors S satisfies property e for e > 1 if for any 
subset A C S of size e and any u E A, 

u — vi span{ro — v:wE S\A}, 

where v E S\A. Recall that by Theorem [27] any set of 
vectors that generates a code C and can rebuild optimally any 
e erasures, satisfies property e. The following theorem shows 
that this property is monotonic, i.e., if S satisfies property e 
then it also satisfies property a for any e < a < | S | . 

Theorem 31 Let S be a set of vectors that satisfies property e, 
then it also satisfies property a, for any e < fl < | S | . 

Proof: Let A C S,\A\ = e + 1 and assume to the 
contrary that u — v E span{ro — v : w E S\A} for some 
u E A and v E S\A. \A\ > 2 hence there exists x E A\u. 
It is easy to verify that u — v E span{w — v : w E S\A*}, 
where A* = A\x and \A*\ = e which contradicts the property 
e for the set S. ■ 

Hence, from the previous theorem we conclude that a code 
C that can rebuild optimally e erasures, is able to rebuild 
optimally any number of erasures greater than e as well. 
However, as pointed out already there are codes with r parities 
that can not rebuild optimally from some e < r erasures. 
Therefore, one might expect to find a code C with parameter 
e* > 1 such that it can rebuild optimally e erasures only 
when e* < e < r. For example, for r = 3, m = 2 let C 
be the code constructed by the vectors {0,ei,e2,ei +£2}- We 
know that any code with more than 3 systematic nodes can 
not rebuild one erasure optimally, since the size of a family of 
orthogonal permutations over the integers [0, 3^ — 1] is at most 
3. However, one can check that for any two erased columns, 
the conditions in Theorem [28] are satisfied hence the code can 
rebuild optimally for any e = 2 erasures and we conclude that 
e* =2 for this code. 

The phenomena that some codes has a threshold parameter 
e*, such that only when the number of erasures e is at least as 
the threshold e* then the code can rebuild optimally, is a bit 
counter intuitive and surprising. This phenomena gives rise to 
another question. We know that for a code constructed with 
vectors from F™, the maximum number of systematic columns 
for optimal rebuilding of e = 1 erasures is m + 1 (Theorem 
l2n i. Can the number of systematic columns in a code with 
an optimal rebuilding of e > 1 erasures be increased? The 



20 



previous example shows a code with 4 systematic columns 
can rebuild optimally any e = 1 erasures. But Theorem |2T| 
shows that when r = 3, m = 2, optimal rebuilding for 1 
erasure implies no more than 3 systematic columns. Hence 
the number of systematic columns is increased by at least 1 
compared to codes with 9 rows and optimal rebuilding of 1 
erasure. The following theorem gives an upper bound for the 
maximum systematic columns in a code that rebuilds optimally 
any e erasures. 

Theorem 32 Let C be a code constructed by Construction |5] 
and vectors from F™ . IfC can rebuild optimally any e erasures, 
for some 1 < e < r, then the number of systematic columns k 
in the code satisfies 

k < m + e. 

Proof: Consider a code with length k and generated by 
vectors cq/ ^1/ ■ • ■ / '^fc-l- If these vectors are linearly inde- 
pendent then k < m and we are done. Otherwise they are 
dependent. Suppose e columns are erased, 1 < e < r. Let 
Vf, be a surviving column. Consider a new set a of vectors: 
T = {vj - Ve : i E [0,k - =^ e}. We know that the 
code can rebuild optimally only if ( [33T l is satisfied for all 
possible e erasures. Thus for any i e, i E [0,k — 1], if 
column i is erased and column e is not, we have Vj — Vf, ^ Z 
and thus Vj — 7^ 0. So every vector in T is nonzero. 
Let s be the minimum number of dependent vectors in T, 
that is, the minimum number of vectors in T such that they 
are dependent. For nonzero vectors, we have s > 2. Say 
{Ve+l ~ ^e/ — '^e, ■ ■ ■ , ^e+s — ^e} is a minimum depen- 
dent set of vector Since any m + 1 vectors are dependent in 
F™, 

s < m + 1. 

We are going to show k — e < s — 1. Suppose to the contrary 
that the number of remaining columns satisfies k — e > s 
and e erasures occur. When column v^+s is erased and the 
s columns {v/,, Ce+i, . . . , C(.+s-l} are not, we should be able 
to rebuild optimally. However since we chose a dependent 
set of vectors, Vg+s — Ce is a linear combination of {Cg+i — 
'Ve> — '^e,---, ^'e+s-l ~ '^e}, whose span is contained in Z 
in ( I33] ). Hence ( |33] | is violated and we reach a contradiction. 
Therefore, 

k — e<s — l<m. 

■ 

Notice that this upper bound is tight. For e = 1 we 
already gave codes with optimal rebuilding of 1 erasure and 
k = m + 1 systematic columns. Moreover, for e = 2 the 
code already presented in this section and constructed by the 
vectors 0, ei, 62, ei + 62, reaches the upper bound with k = 4 
systematic columns. 

D. Generalized Rebuilding Algorithms 

The rebuilding algorithms presented in Constructions I1I5I 
and Theorem [28] all use a specific subspace and its cosets 
in the rebuilding process. This method of rebuilding can be 
generalized by using an arbitrary subspace as explained below. 



Let T = {vq, . . . , v^_i] be a set of vectors generating the 
code in Construction |5] with r'" rows and r parities. Suppose 
e columns [0, e — l] are erased. Let Z be a proper subspace of 
FJ". In order to rebuild the erased nodes, in each parity column 
I E [0, r — 1], access the zigzag elements z| for i E X;, and 
X; is a union of cosets of Z. In each surviving node, access 
all the elements that are in the zigzag sets X/ of parity I. 
More specifically, access element fl, y in the surviving column 
j E [e,k — 1] if i + Ivj E X;. Hence, in the surviving column 
i and parity /, we access elements in rows X; — Ivj. In order 
to make the rebuilding successful we impose the following 
conditions on the sets Xq, X;. Since the number of equations 
needed is at least as the number of erased elements, we require 

'£|X,|=er'". (39) 

1=0 

Moreover, we want the equations to be solvable, hence for any 
erased column i E [0, e — 1], 

Uj'lo ~ = t*^' ~ 1] multiplicity e, (40) 

which means if the union is viewed as a multi-set, then each 
element in [0, r™ — 1] appears exactly e times. This condition 
makes sure that the equations are solvable by Hall's theorem 
(see Theorem |28] |. Under these conditions we would like to 
minimize the ratio, i.e., the number of accesses which is, 

k-l 

^ini? Elu;=o(X'-^^y)l- (41) 

' ' ' ]=e 

In summary, for the generalized rebuilding algorithm one 

first chooses a subspace Z, and then solves the minimization 
problem in dTTT) subject to ( |39] | and ( |40] |. 

The following example interprets the minimization problem 
for a specific case. 

Example 6 Let r = 2,e — 1, i.e., two parities and one erasure, 
then equations (I39]l,(l40b becomes 

|Xo I + |Xi I = 2'", Xo U Xi + 1^0 = [0, 2'" - 1] . 

Therefore Xi + Vq = Xq. The objective function in ( |4T] ) 
becomes, 

k-l k-l 

min J2 1^0 U Xi + Z7y| = min ^ [Xq U (Xq + i^o + I- 

Each Vq + Vj defines a permutation fvo+vj on the cosets ofZ by 
fvg+Vj {-A) = A + Vq + Vj for a coset A of Z. If vq + VjEZ 
then fvQ+vj is the identity permutation and \Xq U (Xq + Vq + 
Vj)\ = 2™, regardless of the choice of Xq. However, if Vq + 
Vj i Z, then /vg+Vj is of order!, i.e., it's composed of disjoint 
cycles of length 2. Note that if fvg+Vj maps A to B and only 
one of the cosets A, B is contained in Xq, say A, then only A 
is contained in Xq U (Xq + Vq + Vj). On the other hand, if both 
A,B E XQorA,B i Xq then, 

A, EC XQU(X^ + VQ + Vj). 

In other words, {A, B) is a cycle in fvg+Vj which is totally 
contained in Xq or in Xq. Define as the number of cycles 
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{A, B) in the permutation fvg+v that are totally contained in X 
or in X, where X is a union of some cosets of Z. It is easy to 
see that the minimization problem is equivalent to minimizing 

k-l 

min y N]^. (42) 

In other words, we want to find a set X which is a union of 
cosets ofZ, such that the number of totally contained or totally 
not contained cycles in the permutations defined by Vj + vq, 
i E \l,k — 1] is minimized. 

From the above example, we can see that given a non- 
optimal code with two parities and one erasure, finding the 
solution in (l42l i requires minimizing for the sum of these 
k — 1 permutations, which is an interesting combinatorial 
problem. Moreover, by choosing a different subspace Z we 
might be able to get a better rebuilding algorithm than that in 
Construction [T] or Theorem |28] 

IX. Concluding Remarks 

In this paper, we described explicit constructions of the 
first known systematic {n,k) MDS array codes with n — k 
equal to some constant, such that the amount of information 
needed to rebuild an erased column equals to l/{n — k), 
matching the information-theoretic lower bound. While the 
codes are new and interesting from a theoretical perspective, 
they also provide an exciting practical solution, specifically, 
when n — k = 2, our zigzag codes are the best known 
alternative to RAID-6 schemes. RAID-6 is the most prominent 
scheme in storage systems for combating disk failures |[T]- 
JS). Our new zigzag codes provide a RAID-6 scheme that has 
optimal update (important for write efficiency), small finite 
field size (important for computational efficiency) and optimal 
access of information for rebuilding - cutting the current 
rebuilding time by a factor of two. 

We note that one can add redundancy for the sake of 
lowering the rebuilding ratio. For instance, one can use three 
parity nodes instead of two. The idea is that the third parity is 
not used for protecting data from erasures, since in practice, 
three concurrent failures are unlikely. However, with three 
parity nodes, we are able to rebuild a single failed node by 
accessing only 1/3 of the remaining information (instead of 
1/2). An open problem is to construct codes that can be 
extended in a simple way, namely, codes with three parity 
nodes such that the first two nodes ensure a rebuilding ratio of 
1/2 and the third node further lowers the ratio to 1/3. Hence, 
we can first construct an array with two parity nodes and when 
needed, extend the array by adding an additional parity node 
to obtain additional improvement in the rebuilding ratio. 

Another future research direction is to consider the ratio of 
read accesses in the case of a write (update) operation. For 
example, in an array code with two parity nodes, in order 
to update a single information element, one needs to read at 
least three elements and write three elements, because we need 
to know the values of the old information and old parities 
and compute the new parity elements (by subtracting the old 
information from the parity and adding the new information). 



However, an interesting observation, in our optimal code 
construction with two parity nodes, is if we update all the 
information in the first column and the rows in the first half of 
the array (see Figure O, we do not need to read for computing 
the new parities, because we know the values of all the 
information elements needed for computing the parities. These 
information elements take about half the size of the entire 
array. So in a storage system we can cache the information 
to be written until most of these elements needs update (we 
could arrange the information in a way that these elements are 
often updated at the same time), hence, the ratio between the 
number of read operations and the number of new information 
elements is relatively very small. Clearly, we can use a similar 
approach for any other systematic column. In general, given 
r parity nodes, we can avoid redundant read operations if we 
update about 1 /r of the array. 

Appendix A 
Proof of Theorem|5] 

In order to prove Theorem |5] we use the well known 
Combinatorial NuUstellensatz by Alon ||2T1 : 

Theorem 33 (Combinatorial NuUstellensatz) [2T\ Th 1.2] Let 

F be an arbitrary field, and let f = /(xj,..., x^) be a poly- 
nomial in F[xi, Xq]. Suppose the degree of f is deg(/) = 
EJ=1 where each ti is a nonnegative integer, and suppose the 
coefficient ofYll^^ x[' in f is nonzero. Then, if Si,...,Sn are 
subsets ofF with |S,| > tj, there are Si E Si,S2 G S2, £ 
Sq SO that 

f{si,...,Sq) 7^ 0. 

Proof of Theorem |5} Assume the information of A is 
given in a column vector W of length pk, where column 
/ e [0,A:— 1] of A is in the row set [(z'p, (z + l)p — 1] 
of W. Each systematic node i, i G [0, fc — 1], can be 
represented as Q,W where Q,- = [Opxp//-fpxp,Opxp(fc-/-i)]- 
Moreover define = [IpxpJpxp, -Jpxp], Qk+l = 

[xqPo/ ^l^'l/ •••/ ^/c-i^\-i] where the P;'s are permutation ma- 
trices (not necessarily distinct) of size p x p, and the X/'s 
are variables, such that Cjt = Q/tW, C/c+i = Q/c+iW. The 
permutation matrix P,- = (pj'j,) is defined as pj'j^ = 1 if 
and only if / G Z;. In order to show that there exists 
such MDS code, it is sufficient to show that there is an 
assignment for the intermediates {x,} in the field F, such that 
for any set of integers {si, S2, Sj-} C [0, fc+ 1] the matrix 
Q = [QJj, QJ^, QJJ is of full rank. It is easy to see that if 
the parity column C/t+i is erased i.e., k + 1 ^ {si,S2, Si-} 
then Q is of full rank. If k ^ {si,S2, ...,Sj-} and k + 1 E 
{si,S2, ...,s^} then Q is of full rank if none of the x,'s equals 
to zero. The last case is when both k,k + l E {si,S2, ...,sj;}, 
i.e., there are < f < / < A: — 1 such that ^ {s-[,S2, ■■■,sif}. 
It is easy to see that in that case Q is of full rank if and only 
if the submatrix 

B, . = ( 'i^' ) 

''^ \ ^pxp Ipxp J 

is of full rank. This is equivalent to det(B,y) 7^ 0. Note that 
deg(det(By)) = p and the coefficient of x[' is det(P/) E 
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{1,-1}. Define the polynomial 

T = T{xo,xi,...,xi,_-^) ^ Yl det(By), 

0<!<;<i:-l 

and the result follows if there are elements ao,a\, G F 

such that r(flo/fli/ 7^ 0. T is of degree and the 

coefficient of nf=o is itiZ] det(Pi)''"^"' ^ 0. Set 

for any i, S, = F\0 in Theorem [33] and the result follows. ■ 



Appendix B 
Proof of Theorem|6] 

In this section we prove Theorem |6] We will need some 
definitions and theorems first. 

For the rebuilding of node / by row and zigzag sets S = 
{Sq, . . . , Sp_i}, define the number of intersections by 

l{i\S) = ^ |S| - I Uses S\ = pk-\ Uses S|. 
SgS 

Moreover define the number of total intersections in an MDS 
array code C as 

k-l 



^(C)= E max ,7(f|S). 



1=0 



S rebuild.s i 



Now define h{k) to be the maximal possible intersections over 
all [k + 1, k) MDS array codes, i.e.. 



max /(C). 

C 



For example, in Figure|2]the rebuilding set for column 1 is S = 
{Ro/^l/-Zo/-Zi}, the size in equation (HI is 8, and -f(l|S) = 4. 

The following theorem gives a recursive bound for the 
maximal number of intersections. 

Theorem 34 Letq <k<p thenh{k) < ^^^(^3^- 

Proof: Let A be an information array of size p x k. 
Construct a MDS array code C by the row sets and the 
zigzag sets that reaches the maximum possible number of 
intersections, and suppose S' achieves the maximal number of 
intersections for rebuilding column i, i G [0, A: — 1]. Namely 
the zigzag sets Z of the code C and the rebuilding sets S' 
satisfy that, 

M^) = /(C) = E max .I(z|S) = xi^('lS')- 

For a subset of columns T C [0, fc — 1] and a rebuilding 
set S' = {Sq, Sp_i} we denote the restriction of S' to 
T by Sj = {So,T/ •■•/ where Sij = {« e S; : 
a is in columns T}. Denote by 



J(;,S')= E |S/n/| 



uL"oS;)n/| 



the number of intersections in column / while rebuilding 
column / by S'. It is easy to see that 



and thus 

h{k)= ^ 7(;,S'). 

Note also that if i ^ j and i,] G T, then 

I{j,S') = I{j,S'j). (43) 

Hence 

::^m.)^(:-)e'(/,so 

= E E n/,s') 

TC[0,A:-1]: 
/t^! i,jeT,\T\=q 

= E E nu^T) 

',]■■ TC[0,J:-1]: 
i,jGT,\T\=q 

= E E ^O'S'r) 

TC[0,*:-1]: i,jeT: 

< E ^('?) (44) 

TC[0,*:-1]: 
1^1=9 

h{q). 

Inequality (l44l i holds because the code restricted in columns 
T is a {q + 2,q) MDS and optimal-update code, and h{q) is 
the maximal intersections among such codes. Hence, 



h{k) < 



_ k{k-\)h{q) 
q{q-l) 



and the result follows. ■ 
For a (fc + 2, k) MDS code C with p rows the rebuilding 
ratio R{C) can be written as 



m 



k{p{k-l)-I{C) + p) 



I{C) + pk 



p{k + l)k pik + l)k' 

Notice that in the two parity nodes, we access p elements 
because each erased element must be rebuilt either by row or 
by zigzag. Thus we have the term p in the above equation. 
And the ratio function for all {k + 2,k) MDS codes with p 
rows is 

h{k) + pk 



m 



minR(C) 

C 



1 



p{k + l)k' 

Proof of Theorem |6} Consider a {k + 2, k) code with 
p rows and assume a systematic node is erased. In order to 
rebuild it, p row and zigzag sets are accessed. Let x and p — x 
be the number of elements that are accessed from the first 
and the second parity respectively. W.l.o.g we can assume that 
X > |, otherwise p — x would satisfy it. Each element of these 
X sets is a sum of a set of size k. Thus in order to rebuild the 
node, we need to access at least x(fc — 1) > ^^^^ ^"^ elements 
in the k — 1 surviving systematic nodes, which is at least half 
of the size of these nodes. So the number of intersections is 
Thus 



no more than 



h{k) < 



pk{k-l) 



(45) 
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and the ratio function satisfies 



pk{k + 1 



pk{k + l) 2' 



So the rebuilding ratio is no less than 1/2. 
From Theorem [34] we get. 



, (k + l)kh(k) (k + l)h(k) 



Hence, 



R{k + 1) 



> 1 



p{k + l){k + 2) k + 2 

h{k) _ 1 
p{k-l){k + 2) ~ k + 2 
h{k) + p{k-l) 
p{k-l){k + 2) 
^ h{k) + pk 
pk{k + l) 

= m, 



(47) 



where ( |47] | follows from (05]). Thus the ratio function is 
nondecreasing. ■ 
The lower bound of 1 /2 in the theorem can be also derived 
from the repair bandwidth ([Hi. 
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